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EDITORIAL 


Learning to deal with dual use 


iological research is profoundly valuable but can 
carry profound risks. The coronavirus outbreak 
reminds us of our vulnerability to biological 
threats and that research on pathogens is vital 
to threat mitigation. But such research can lead 
to catastrophic safety and security incidents. 
A global proliferation of tools and capabilities, 
driven by economic and national security interests, is 
also generating risks that fall outside current gover- 
nance frameworks. We must learn to manage risks as 
quickly as we learn to manipulate life, but it remains 
unclear how well we are doing. One opportunity to 
learn is found in a new charge to the U.S. National Sci- 
ence Advisory Board for Biosecurity (NSABB), which 
reconvened in January after a 2-year hiatus, to assess 
the effectiveness of U.S. dual-use research oversight 
policies. To meet the charge and 
suggest improvements, the NSABB 
must address a neglected need for 
an evidence base for adaptive risk 
management. Building this evi- 
dence base will require revisiting 
success criteria, creating data in- 
frastructures, and fostering adap- 
tive policies and testbeds. 

Clear criteria for success form 
the foundation of an evidence 
base. Criteria for policies designed 
to prevent rare incidents can be 
challenging to develop, but there 
are leading indicators that can 
validate design goals and assump- 
tions. These can range from enhanced stakeholder 
participation and expressions of trust to reductions 
in near misses. Such criteria form the basis for sound 
policy design and implementation and must be regu- 
larly revisited. 

The next step is to collect and analyze data to assess 
whether goals were met. A 2011 U.S. government-com- 
missioned risk and benefit analysis of gain-of-function 
research exposed the dearth of data available to inform 
many risk management decisions. The NSABB and other 
groups have called—to no avail—for systems to collect 
and analyze data on incidents and near-misses as well 
as oversight. Failure of the U.S. government to quickly 
act on these recommendations means that the NSABB 
will again confront this data gap. Moreover, biosafety 
and biosecurity research, which seeks to understand and 
improve practices for risk management, remains chroni- 
cally underfunded despite being called out as a priority 
in the U.S. National Biodefense Strategy. This research 


SCIENCE sciencemag.org 


“We must learn 
to manage risks 


as quickly 
as we learn to 
manipulate life...” 


should be supported as an essential part of all major 
biological research programs so that it evolves alongside 
and is integrated with the latest advances. 

Unfortunately, even when limited data are col- 
lected, they are often not shared lest they expose vul- 
nerabilities or tarnish reputations. Legitimate security 
concerns abound, but an unknown and growing risk 
exposure is also dangerous. Many organizations do not 
fully exercise their ability to share details of their risk 
management process; they should do so to promote 
cross-organizational learning and to avoid repeating 
the same mistakes. 

Policies designed to adapt to emerging evidence can 
provide standards and incentives for gathering data. Yet 
dual-use policies are typically narrowly scoped to pri- 
oritize limiting oversight burden over learning where 
concerns arise. This priority is mis- 
placed in preparing for the future. 
As the scope of research broadens, 
our definitions of dual use will 
change, and we need to continue 
moving toward functional defini- 
tions of activities that could be 
hazardous that aren’t reliant on 
quickly outdated lists. In many in- 
stances where there is uncertainty 
over appropriate scope, we should 
update policies regularly and pri- 
oritize observing activities over re- 
stricting them to gain visibility into 
the evolving landscape. 

Communities of practitioners 
need testbeds to experiment with policy approaches 
and to learn from each other in real time. One example 
is the international Genetically Engineered Machine 
(iGEM) competition, where thousands of students 
from dozens of countries work on hundreds of projects 
that can often expose policy gaps. The competition 
has becomes a nexus to test strategies for managing 
dual-use technologies across many cases with yearly 
iteration. Such testbeds should be cultivated at many 
scales—from individual labs, to universities, to re- 
gional and national networks. 

Biological science and technology will only become 
more essential to our societies, and we need strategies 
to learn to manage their power. Investing in an evi- 
dence base for adaptive risk management is essential to 
ensure that the future of the life sciences is one in which 
we want to live. 


—Megan J. Palmer 
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National Institute of Allergy and Infectious Diseases Director Anthony Fauci, to Politico last week 
on the “fine balance” needed for a credible U.S. government response to the coronavirus crisis. 


’ Edited by Catherine Matacic 


An artist's rendition of the Wide Field Infrared Survey Telescope. 


NASA survey scope wins approval 


ASA’s Wide Field Infrared Survey Telescope (WFIRST), a big 

orbiting observatory planned for launch in the mid-2020s, 

has reached a major milestone. On 2 March, the agency said 

the project passed its confirmation review and could begin to 

build hardware. Thanks to a 2.4-meter mirror donated in 2011 

by the U.S. Department of Defense’s National Reconnaissance 
Office, WFIRST will peer deep into the cosmos with the sensitivity 
of the Hubble Space Telescope, but 100 times the field of view. It 
will track the shapes, positions, and distances of millions of galaxies 
to understand how dark matter influenced their formation and how 
dark energy is boosting cosmic expansion. It will also carry a corona- 
graph, a mask to block out a star’s light so that orbiting exoplanets 
can be seen directly. To keep within its $3.2 billion budget, the project 
eased the requirements of the coronagraph and its attached spectro- 
graph (Science, 27 October 2017, p. 433). Last month, the Trump ad- 
ministration proposed killing the project in its 2021 budget request, 
but astronomers hope Congress will reverse that, as it did after simi- 
lar budget requests for 2019 and 2020. 


1058 6 MARCH 2020 « VOL 367 ISSUE 6482 


Published by AAAS 


Coronavirus KOs major meeting 


MEETINGS | Citing the spread of the new 
coronavirus, the American Physical Society 
(APS) abruptly canceled its largest meeting 
of the year, just 36 hours before more than 
10,000 physicists were to convene this 
week in Denver. APS made the decision on 
29 February, hours after the U.S. Centers 
for Disease Control and Prevention advised 
against nonessential travel to South Korea 
and Italy, APS CEO Kate Kirby told Science. 
More than 500 people were registered from 
those countries. Although the cancella- 
tion inconvenienced many, Kirby says APS 
leaders were concerned about the health 
of registrants, vendors, and local residents. 
They had also envisioned a “nightmare 
scenario” in which a few infected attendees 
might cause thousands to be quarantined. 
As Science went to press, many major con- 
ferences were still on, including the spring 
meeting of the American Chemical Society 
and the European Congress of Clinical 
Microbiology and Infectious Diseases. 


Earth’s fly-by-night visitor 


PLANETARY SCIENCE | Some 18 months 
ago, Earth’s gravity roped in a car-size aster- 
oid, making it the planet’s second detected 
“minimoon,” astronomers reported last 
week. (Another captured asteroid orbited 
Earth from 2006 to 2007.) Discovered 

by Kacper Wierzchos and Theodore Pruyne 
of the Catalina Sky Survey at the Mount 
Lemmon Observatory and provisionally 
dubbed 2020 CD3, the rock, of unknown 
composition, likely won’t stick around: CD3 
has a wobbly orbit that in a few weeks will 
fling it from Earth and return it to its orbit 
around the Sun. Smaller minimoons occur 
frequently, scientists think, but visitors the 
size of CD3 may come only once a decade. 


Trials must post missing data 


CLINICAL TRIALS | Drug companies, device 
manufacturers, and universities must 
make public data from hundreds of clinical 
trials conducted in the United States from 
2007-17, a federal judge ruled last week. 
The ruling, from the Southern District of 
New York, says that government agencies 
including the National Institutes of Health, 
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Glow-in-the-light amphibians , .. 
ost salamanders don't stand out: Their : 
mottled, earth-toned skin helps them 
blend into the background of forests and @ * = 
streams around the world. But shine the 
right type of light on them, and they light 

up like glow sticks. That’s the finding of a new x 

study, which reveals for the first time that most 

amphibians—from salamanders to frogs—have 

fluorescent compounds in their skin, bones, 

and even urine that absorb the surrounding light 

and re-emit it at specific wavelengths. When 

researchers put 32 species of amphibians under 

a blue or ultraviolet light, all were biofluores- 

cent, emitting a greenish to yellow light from 

their skin, they reported last week in Scientific 

Reports. This widespread occurrence suggests 

biofluorescence appeared early in the evolution- - 

ary history of amphibians. 


An eastern tiger salamander exposed to regular light 
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(top) and blue light (bottom). 


the Food and Drug Administration, and the 
Department of Health and Human Services 
for years misinterpreted a law requiring 
them to make clinical trial sponsors post 
the data to ClinicalTrials.gov. If the ruling 
is upheld, it will likely discourage drug 
companies from keeping unfavorable 
results from the public, and it could offer 
vital information for patients and doctors. 
Still unclear is how quickly the agencies 
might move to fill in the 10-year gap in 
compliance—and what the consequences 
would be for clinical trial sponsors that 
don’t comply. 


Canada steps up whale safegu 


CONSERVATION | With only about 400 
North Atlantic right whales (Eubalaena 
glacialis) remaining, Canada is stepping 
up its efforts to keep them safe from ships 
and dangerous fishing gear, even as it 
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tries to keep its lucrative snow crab and 
lobster fisheries thriving. New regula- 
tions announced last week expand the 
area in which whales will be protected by 
“dynamic” restrictions—which trigger only 
after the animals have been spotted—to the 
entire Gulf of St. Lawrence and the Bay of 
Fundy. The new restrictions should give 
fishers more harvesting opportunities than 
seasonlong shutdowns. But it also means 
fishing gear could already be in the water 
when the whales arrive. “That creates a 
huge risk,” says Kristen Monsell, an attor- 
ney for the Center for Biological Diversity. 


Wind power tops water in U.S. 


ENERGY | Last year, wind energy for the 
first time surpassed hydropower as the top 
source of renewable electricity genera- 
tion in the United States, according to a 
report out last month from the U.S. Energy 
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Information Administration. In 2019, wind 
turbines produced 300 million megawatt 
hours (MWh), compared with 274 million 
MWh from dams and 146 million MWh 
from all other renewables. The United 
States is second only to China in wind 
energy production; that country generated 
more than 366 million MWh in 2018, the 
last year for which statistics are available. 
Globally, hydropower continues to be the 
largest source of renewable energy. 


A satellite that foils space junk 


SPACE SCIENCE | One Satellite docked 
autonomously last week with another, der- 
elict one and rescued it, a milestone for a 
spaceflight industry increasingly concerned 
with space junk. Some 36,000 kilome- 

ters above Earth, Northrop Grumman’s 
Mission Extension Vehicle (MEV-1) latched 
onto Intelsat 901 (IS-901), a 19-year-old 


6 MARCH 2020 « VOL 367 ISSUE 6482 1059 


NEWS | IN BRIEF 


a 


When English is a barrier 
English is the lingua franca of modern 
science. But for the millions of scientists 
whose mother tongue isn’t English, the 
language can be a barrier to publishing 
papers, presenting at conferences, and 
even reviewing the scientific literature. 
Valeria Ramirez Castafieda, a Ph.D. student 
in biology at the University of California, 
Berkeley, who grew up in Colombia and 
studied science communication in Spain, 
decided to survey fellow researchers about 
publishing in a language that is not their 
own. Her findings were posted last month 
on the preprint server bioRxiv. Science 
spoke (in Spanish) to Ramirez Castafieda 
about the problem, and possible solutions. 


Q: How did you come up with this idea? 

A: When | was in university in Colombia, 

| was privileged to be able to read 

in English—most of my classmates 
understood little to nothing. While doing 
my master’s, | had the choice of writing my 
thesis in English. | tried, but in the end, | 
couldn't do it. It was very frustrating. And | 
knew many other students felt the same. 


Q: What did you find? 

A: Of 49 students, 43.5% said they had 
papers rejected because of English. 

On average, they spent 10 more days 
preparing a manuscript in English than 
they did in Spanish; 33% of them didn’t 
attend meetings where they were required 
to present in English. Beyond the numbers, 
the response has been very emotional. 
Colleagues have told me they thought of 
leaving science ...and some haven't been 
able to graduate because they cannot pay 
for English courses. It’s emotionally hard 
to realize. Everyone deserves and has the 
right to be a scientist. 


Q: What should scientists do? 

A: Right now, the responsibility falls on 
individuals to learn English, no matter 
where they were born, whether they're 
poor, or whether they have trouble learning 
another language. And that cannot happen 
anymore! Institutions, scientific publishers, 
government agencies, professors, and 
meeting organizers must provide solutions. 
Most already exist: including proofreading 
services in publishing fees, promoting 
spaces in publications for second 
languages, providing editing and university 
training to science students, and hiring 
[interpreters] at international meetings. 


S SCIENCEMAG.ORG/NEWS 
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communications satellite nearly out of 
propellant. For a cost of about $13 million 
annually, MEV-1 will serve as a sort of 
jetpack, keeping IS-901 in a geostation- 
ary orbit. After 5 years, MEV-1 will move 
IS-901 to a higher “graveyard” orbit, release 
its companion, and attach itself to a new 
satellite. More than 5000 satellites orbit 
Earth, but just 40% are still operational. 
Researchers have been testing different 
ways to collect or eliminate space junk, 
including nets, harpoons, and lasers. 


Constant unmoved by black hole 


ASTROPHYSICS | The constant of nature 
that sets the strength of the electromag- 
netic force isn’t altered by the gravity of 

a supermassive black hole, astronomers 
reported last week in Physical Review 
Letters. Researchers observed five stars 
orbiting the black hole in the center of our 
galaxy, a behemoth 4 million times as mas- 
sive as the Sun. In the stars’ atmospheres, 
elements including sodium, titanium, 

and silicon absorb certain wavelengths of 
starlight, creating dark lines in the stars’ 
spectra. The lines’ spacing depends on the 
strength of the electromagnetic force and 
the fine structure constant. Researchers 


SCIENCE POLICY 


And the Vannevar goes to... 


he U.S. National Science Foundation 
(NSF) owes a lot to Vannevar Bush, 
whose seminal 1945 report made 
the case for federal funding of aca- 


demic research. So when NSF officials 
learned that a bronze sculpture of the former 
Massachusetts Institute of Technology engi- 
neering dean was tucked away in a warehouse 


of the Smithsonian Institution, they wanted 
to bring it to NSF's new headquarters in 


Alexandria, Virginia. But Smithsonian officials 
said NSF didn't have the proper environmental 
controls needed to house the 57-centimeter- 


tall statue, made for a 1944 exhibit on 

“50 notable men of wartime.” NSF applied 
some of the “Yankee ingenuity” Bush had 
lauded in his report, Science, The Endless 
Frontier. Using 3D printing, it replicated the 


sculpture (right) and installed it in the office 


of NSF Director France Cérdova. And now, 


it’s making miniature versions. Last week, as 


the National Academy of Sciences hosted a 


75th anniversary celebration of Bush's report, 


Cérdova presented NAS President Marcia 


McNutt with the first “Vannevar,” calling it the 


scientific equivalent of an Oscar. 
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found no change in their spacing, putting 
a dent in speculative theories in which the 
fine structure constant could change in 
intense gravitational fields. 


Freeman Dyson dies at 96 


PHysics | Theoretical physicist, futurist, 
and writer Freeman Dyson died last week in 
Princeton, New Jersey, where he spent his 
career at the Institute for Advanced Study. 
Born and raised in the United Kingdom, 
Dyson established himself in 1949, when he 
proved that two competing quantum theo- 
ries of electricity, magnetism, and light—one 
more practical and ad hoc and the other 
more comprehensive but unwieldy—were 
equivalent. But he did not share in the 1965 
Nobel Prize in Physics, which honored the 
theory of quantum electrodynamics. In 

the late 1950s, Dyson joined Project Orion 
to design a spaceship propelled by atomic 
bombs, a goal abandoned after a 1963 arms 
control treaty banned nuclear explosions 

in space. In 1960, he mused that an alien 
civilization might surround its star with an 
array of energy absorbers, later known as a 
“Dyson sphere,’ detectable from afar. In his 
later years, he angered peers by denying and 
downplaying the threat of climate change. 
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Haein Ree 


Bruce Aylward 


Bruce Aylward of the World Health Organization holds up a graphic showing China’s plummeting coronavirus cases at a 24 February press conference in Beijing. 


Can China’s COVID-19 strategy work elsewhere? 


Rapid decline in cases is real, expert mission concludes—but it came at a high cost 


By Kai Kupferschmidt and Jon Cohen 


hinese hospitals overflowing with 

COVID-19 patients a few weeks ago 

now have empty beds. Trials of ex- 

perimental drugs can’t find enough 

eligible patients. And the number of 

new cases reported each day in China 
is dropping precipitously. 

These are some of the startling observa- 
tions in a report released on 28 February by 
a team of 12 Chinese and 13 foreign scien- 
tists who toured five cities in China to study 
the state of the COVID-19 epidemic and 
the effectiveness of the country’s response. 
Even some on the team, organized jointly by 
the World Health Organization (WHO) and 
the Chinese government, say they were sur- 
prised. “I thought there was no way those 
numbers could be real,” says epidemiologist 
Tim Eckmanns of the Robert Koch Institute 
in Berlin. 

But the report is unequivocal. “China’s 
bold approach to contain the rapid spread 
of this new respiratory pathogen has 
changed the course of a rapidly escalat- 
ing and deadly epidemic,” it says. To Bruce 
Aylward, a Canadian WHO epidemiologist 
who led the mission and briefed journalists 
in Beijing and Geneva last week, the effort 
was a huge success. “Hundreds of thousands 
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of people in China did not get COVID-19 be- 
cause of this aggressive response,” he says. 

Aylward and other members of the task 
force say the rest of the world should learn 
from China. But critics say the report failed 
to acknowledge the human rights costs 
of the most severe measures imposed by 
China’s authoritarian government: massive 
lockdowns and electronic surveillance of 
millions of people. “I think there are very 
good reasons for countries to hesitate us- 
ing these kinds of extreme measures,” says 
Lawrence Gostin, a global health law scholar 
at Georgetown University. Many also worry 
that a resurgence of the disease will occur 
after the country lifts some of its strictest 
control measures and restarts its economy, 
which has taken a huge hit. 

The report comes at a critical time in 
what many epidemiologists now consider a 
nascent pandemic. The number of affected 
countries is rising rapidly—it stood at 72 as 
Science went to press, according to WHO. 
Alarmingly, in many of these countries, the 
virus has quickly gained a foothold and 
started to spread in communities. 

But cases have plummeted in China. 
On 10 February, the first day of the mis- 
sion, the country reported 2478 new cases. 
Two weeks later, when the foreign experts 
packed their bags, the daily number of new 
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cases had dropped to 409. (On 3 March it 
had dropped further to 129 new cases, com- 
pared with 1848 in the rest of the world.) 
China’s epidemic appears to have peaked in 
late January, according to the report. 

Members of the team traveled to Bei- 
jing, Shenzhen, Guangzhou, Chengdu, and 
the hardest hit city, Wuhan. They visited 
hospitals, laboratories, companies, live 
animal markets, train stations, and local 
government offices. “Everywhere you went, 
anyone you spoke to, there was a sense of 
responsibility and collective action—and 
there’s a war footing to get things done,” 
Aylward says. 

As part of the effort, Chinese scientists 
have compiled a massive data set that gives 
the best available picture of the disease. 
The mission report says about 80% of in- 
fected people had mild to moderate disease, 
marked by fever and a dry cough; 13.8% 
had severe symptoms; and 6.1% had life- 
threatening episodes of respiratory failure, 
septic shock, or organ failure. The case fa- 
tality rate was highest for people over age 
80 (21.9%), and people who had heart dis- 
ease, diabetes, or hypertension, but 3.8% 
overall. Children made up a mere 2.4% of 
the cases, and almost none was severely ill. 
People with mild and moderate illness took 
2 weeks on average to recover. 
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The report highlights how China achieved 
what many public health experts thought 
was impossible: containing the spread of a 
widely circulating respiratory virus. “China 
has rolled out perhaps the most ambitious, 
agile, and aggressive disease containment 
effort in history,’ the report notes. The most 
dramatic—and controversial—measure was 
the lockdown of Wuhan and nearby cities in 
Hubei province, putting at least 50 million 
people under a mandatory quarantine since 
23 January. That has “effectively prevented 
further exportation of infected individuals 
to the rest of the country,’ the report con- 
cludes. Most of China did not face such se- 
vere measures: People were asked, but not 
required, to quarantine themselves if they 
felt ill, and neighborhood leaders moni- 
tored their movements. 

Chinese authorities also built two dedi- 
cated hospitals in Wuhan in about 1 week, 
sent health care workers from all over 
China to Hubei, and launched an unprece- 
dented effort to trace contacts of confirmed 
cases. In Wuhan alone, more than 1800 
teams traced tens of thousands of contacts. 
Aggressive “social distancing” measures 
implemented in the entire country included 
canceling sporting events and shuttering 
theaters, schools, and businesses. Anyone 
who went outdoors had to wear a mask. 

Two widely used mobile phone apps, 
AliPay and WeChat—which in recent years 
have replaced cash in China—have helped 
enforce the restrictions, because they allow 
the government to keep track of people’s 
movements and even stop people with con- 
firmed infections from traveling. “Every 
person has sort of a traffic light system,” 
says mission member Gabriel Leung, dean 
of the Li Ka Shing Faculty of Medicine at 
the University of Hong Kong. Color codes 
on mobile phone screens—in which green, 
yellow, or red designate a person’s health 
status—let guards at train stations and 
other checkpoints know who to let through. 

“As a consequence of all of these mea- 
sures, public life is very reduced,” the report 
notes. But the measures did work. In the 
end, infected people rarely spread the vi- 
rus to anyone except members of their own 
household, Leung says. Once all the people 
living together were exposed, the virus had 
nowhere else to go and chains of transmis- 
sion ended. “That’s how the epidemic truly 
came under control,” Leung says. 

It’s debatable how much of this could be 
done elsewhere. “China is unique in that it 
has a political system that can gain pub- 
lic compliance with extreme measures,” 
Gostin says. The country also has an ex- 
traordinary ability to do labor-intensive, 
large-scale projects quickly, says Jeremy 
Konyndyk, a senior policy fellow at the 
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Center for Global Development: “No one 
else in the world really can do what China 
just did” 

Nor should they, says lawyer Alexandra 
Phelan, a China specialist at Georgetown’s 
Center for Global Health Science and Secu- 
rity. “There are plenty of things that would 
work to stop an outbreak that we would 
consider abhorrent in a just and free soci- 
ety,’ Phelan says. 

The report urges China “to more clearly 
communicate key data and developments 
internationally.” But it is mum on the co- 
ercive nature of China’s control measures 
and the toll they have exacted. “The one 
thing that’s completely glossed over is the 
whole human rights dimension,” says Devi 
Sridhar, a global public health specialist at 
the University of Edinburgh. Instead, the 
report praises the “deep commitment of the 
Chinese people to collective action in the 
face of this common threat.” 

“To me, as somebody who has spent 
a lot of time in China, it comes across as 
incredibly naive—and if not naive, then 
willfully blind to some of the approaches 
being taken,” Phelan says. Singapore and 
Hong Kong may be better examples to 
follow, Konyndyk says: “There has been a 
similar degree of rigor and discipline but 
applied in a much less draconian manner.’ 
Jennifer Nuzzo of the Johns Hopkins Uni- 
versity Bloomberg School of Public Health 
also wonders what effects China’s strategy 
had on, for instance, the treatment of can- 
cer or HIV patients, whose care may have 
been interrupted. “I think it’s important 
when evaluating the impact of these ap- 
proaches to consider secondary, tertiary 
consequences,” she says. 

And the benefit may be _ short-lived. 
“There’s no question they suppressed the 
outbreak,” says Mike Osterholm, head of the 
Center for Infectious Disease Research and 
Policy at the University of Minnesota, Twin 
Cities. Reducing the peak number of cases 
buys a health system time to deal with later 
ones, public health experts say. But once the 
restrictions are lifted, “It'll come roaring 
right back,’ Osterholm predicts. 

Aylward and the other visiting scientists 
on the team were well aware of the “reality 
of different political systems,’ he says, but 
they spoke with hundreds of people around 
the country and “everyone agreed with the 
approach.” He hopes China’s successes so 
far will encourage other countries to act 
quickly. “We’re getting new reports daily 
of new outbreaks in new areas, and people 
have a sense of, ‘Oh, we can’t do anything,” 
Aylward says. “Well, sorry. There are really 
practical things you can do to be ready to be 
able to respond to this, and that’s where the 
focus will need to be.” 
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ATMOSPHERIC SCIENCE 


Why weather 
systems are 
apt to stall 


A new theory tries to explain 
enigmatic “blocks” that bring 
heat waves and drought 


By Paul Voosen 


t was the summer of 2003 in Europe, 

and, for a while, it seemed as if Earth’s 

weather system had broken down. For 

weeks a huge mass of air stalled over 

the continent, slowly subsiding and 

suppressing cloud formation, leaving 
day after day of brilliantly clear skies. The 
mercury rose, and a record-breaking heat 
wave gripped countries including France 
and Germany, causing 70,000 deaths. Then, 
as abruptly as it set in, the persistent at- 
mospheric block eased, and high winds 
brought relief. 

Few weather phenomena are as widely 
experienced—but poorly understood—as an 
atmospheric block. When a block arises, typi- 
cally at the western edge of a continent, the 
jet stream splits, trapping a blob of seemingly 
static air thousands of kilometers across. 
Such blocks can last for weeks, and drive 
heat waves, drought, and winter cold snaps. 
At the same time, the persistent flows around 
the edges of a block can route storm after 
storm to the same spot. A block “has very dif- 
ferent impacts in different seasons,’ says Tim 
Woollings, an atmospheric dynamicist at the 
University of Oxford. “But it’s always quite ex- 
treme.” Yet atmospheric scientists have long 
struggled to understand why blocks occur 
and last so long, and how they might change 
in a warming world. 

Several new theories are offering an- 
swers. A leading idea links blocking to the 
behavior at high latitudes of the Coriolis 
force, an effect of Earth’s rotation that can 
cause the jet stream to meander and con- 
tort. The theory, developed by Harvard Uni- 
versity atmospheric scientist Lei Wang, is 
unlikely to be the full picture, but it has a 
sobering implication. As the world warms, 
the jet stream is likely to move to higher 
latitudes, which could lead to even more 
blocking events. 

The new ideas about blocking emerged 
from debate over another potential impact of 
climate change. Researchers led by Jennifer 
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Francis, a climate scientist at the Woods Hole 
Research Center, have suggested a warm- 
ing Arctic is leading to more severe winter 
storms in the midlatitudes (Science, 18 April 
2014, p. 250). Francis has argued that the en- 
croaching warmth is slowing the jet stream 
and causing it to form larger blocklike mean- 
ders, exacerbating winter weather. But other 
scientists are skeptical of the mechanism. 
The controversy has exposed just how much 
there is to learn about blocking, Wang says. 
“The lack of theory is really the root cause of 
a lot of confusion we now have.” 

Scientists did have a standard explanation 
for what sustains blocks. Published in 1983, 
the idea stems from close study of a single 
block in 1976 that led to drought in Europe. 
Once the block was established, the split jet 
stream seemed to shear later weather systems 
apart. The energy of these atmospheric ed- 
dies then fed into the slowly rotating trapped 
airmass, creating a feedback loop that re- 
inforced and sustained the block. It was an 
elegant idea, says Stephan Pfahl, an atmo- 
spheric dynamicist at the Free University of 
Berlin, but “its connection to more realistic 
cases has always been a question mark.” 

Wang revisited this work. First, he simu- 
lated blocking events in a series of simple at- 
mospheric models, hoping to detect the eddy 
feedback, as it’s known. He was shocked—it 
didn’t exist. Looking at the original calcula- 
tions from the 1980s, Wang saw an error: 
They generated the feedback even when the 
block was missing. He then used modern 
weather records to examine some 100 simi- 
lar blocking events over several decades. He 
found that eddies played a minimal role, at 
most, in maintaining them, as he reported 
this year at the annual meeting of the Ameri- 
can Meteorological Society. All signs pointed 
to one conclusion, he says. “The existence of 
the feedback that was discovered in that pa- 
per is a false alarm.” 

Two years ago, Wang’s former adviser, 
atmospheric scientist Noboru Nakamura 
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at the University of Chicago, proposed an 
alternative explanation for blocks. Over the 
years, he had painstakingly developed an 
equation to track large meanders in the jet 
stream. Such bends can eventually break, 
causing the jet to split. Nakamura found 
that these meanders seemed to behave like 
cars on a highway; pile enough together on 
the jet, and a traffic jam—a block—arises, he 
argued 2 years ago in Science. Atmospheric 
scientists are still digesting Nakamura’s 
proposal. But Wang and others suspect it 
may be more complicated than needed. 
Wang and his co-author, Zhiming Kuang, 
also at Harvard, have come up with their 


No change in the weather 

Ina “dipole” atmospheric block, the jet stream 
splits, wrapping around slowly rotating high- and 
low-pressure air masses and trapping them. 


own theory. While a student at the Univer- 
sity of Chicago, Wang had pulled off the 
lunchroom shelf the dissertation of Tu-cheng 
Yeh, who studied in Chicago with pioneering 
geophysicist Carl-Gustaf Rossby in the 1940s. 
Yeh, who became a famed meteorologist 
in China, had noted that blocking events are 
more common and last longer when the jet 
stream moves to higher latitudes. 

That pattern, Yeh proposed, is set by the 
atmosphere’s inherent dispersiveness— 
essentially, the noise and chaos in the sys- 
tem. At low latitudes, the atmosphere is 
too noisy to sustain a long-term pattern. 
For example, giant undulations in airflows 
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In 2018, an atmospheric block caused drought that 
dried out the Rhine River in Germany. 


like the jet stream, called Rossby waves, are 
quickly canceled out by other waves of dif- 
ferent amplitudes and frequencies. But at 
higher latitudes, where the globe gets nar- 
rower, the waves generated by the Coriolis 
force are squeezed into a smaller band. That 
squeeze makes them more likely to align, 
which can amplify them, forcing the jet to 
split and give rise to a block. 

Yeh had demonstrated his idea only in a 
simple, 1D model. But now, Wang and Kuang 
have found it holds in more complex simu- 
lations. The work is still under review, with 
one part posted on the preprint server arXiv. 
“But it’s potentially a breakthrough,’ says 
Jian Lu, a climate dynamicist at the Pacific 
Northwest National Laboratory. “Fundamen- 
tally, blocking may be a simpler phenom- 
enon than we previously thought.” In the 
past, simple explanations couldn’t explain 
why long-lasting blocks are so frequent. In 
Wang’s model, the latitude dependence does 
just that, says Eric DeWeaver, a program 
manager for climate and large-scale dynam- 
ics at the National Science Foundation. “It’s 
very appealing as a back to basics rethink.” 

Wang's theory shares a flaw with its pre- 
decessors, however: They all treat the atmo- 
sphere as dry. Over the past 5 years, Pfahl 
and others have shown that moisture is cru- 
cial to sustaining blocks. Indeed, Pfahl found 
in case studies and models that many blocks 
persist only if fed air in which water vapor 
has condensed into clouds and rain, releas- 
ing heat that uplifted the air mass. “For me 
this is the really new thing,” Woollings says. 

The next step will be to see whether 
the new insights can improve forecasts 
of weather and climate. Current models 
project a small decline in the frequency of 
blocks as the world warms, but scientists 
have low confidence in that prediction, says 
Olivia Martius, an atmospheric dynami- 
cist at the University of Bern. “There is no 
simple answer” for how to improve such 
predictions. But the new theories could 
identify thresholds—specific atmospheric 
conditions—at which blocks are likely to 
proliferate, Nakamura says. “If you can nail 
the threshold condition, then you can ask 
how climate changes that threshold.” 

Wang’s theory suggests one answer. 
Warming is expected to push the polar jet 
stream in the Northern Hemisphere far- 
ther north. And Wang has found that shift- 
ing the stream 10° closer to the pole could 
bring a 10-fold increase in blocks—and 
the heat waves and droughts they foster. 
“Northern Europe would experience many 
more,” Lu says. “Especially Russia. It’s a 
huge impact.” & 
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ANIMAL HEALTH 


Unplanned experiment could 
help save a key farmed fish 


Random outbreak at research center points to tilapia 
variants that resist the deadly emerging tilapia lake virus 


By Erik Stokstad 


bout a decade ago, farm-raised tilapia 

in Israel began to die mysteriously. 

The fish had ulcerated skin and inter- 

nal hemorrhages; sometimes ponds 

full of fish were wiped out. In 2014, 

researchers identified the culprit: a 
previously unknown virus they named tila- 
pia lake virus. Since then, the virus has been 
detected on farms in Asia, Africa, and the 
Americas. There’s no cure and no vaccine, 
and the virus is likely spreading, threaten- 
ing one the world’s most important farmed 
fish. “It’s a major global problem,’ says John 
Benzie, a geneticist at WorldFish, an interna- 
tional publicly funded research center. 

New findings, however, are providing 
hope that Nile tilapia (Oreochromis niloti- 
cus), the most common kind of farmed ti- 
lapia, could be bred to resist the virus. In a 
lucky break, in 2018 the virus struck a pond 
at WorldFish holding numerous tilapia pro- 
duced for breeding experiments, and some 
of the fish proved completely resistant to 
the virus, Benzie and colleagues reported 
last month in Aquaculture. “It’s good news 
for the tilapia sector,’ says Morten Rye, 
a fish geneticist at Benchmark Genetics, 
a company that breeds tilapia and other 
aquacultural species. 

Tilapia is the second most popular fish 
in aquaculture, after carp, with farmers in 
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more than 120 countries now harvesting 
about 6 million tons per year. It’s especially 
important in developing nations, many of 
which rely on a productive strain first de- 
veloped in the 1990s at WorldFish. 

The emerging virus affects several types 
of farmed tilapia, and was probably caus- 
ing problems for at least several years be- 
fore its discovery. Although researchers 
know some regions have been hit hard, the 
overall distribution and impact of the virus 
are not clear. Because the World Organisa- 
tion for Animal Health has not certified a 


A threatened bounty 
Production of pond-raised Nile tilapia has been 
booming in Asia and Africa, helping the species 
become the second most popular farmed finfish 
in the world, behind carp. 
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Farm-raised tilapia at a market in Egypt, where 
growers have had outbreaks of a damaging fish virus. 


diagnostic test that works in all situations, 
countries are only required to report major 
outbreaks. More than a dozen countries 
do voluntarily report additional data, but, 
“We really need descriptive and analytical 
epidemiological studies to help us under- 
stand the situation in the field,” says Mona 
Dverdal Jansen, a veterinary epidemiologist 
at the Norwegian Veterinary Institute. 

In the meantime, researchers at World- 
Fish are working with the University of 
Edinburgh’s Roslin Institute to breed Nile 
tilapia that resist the virus. The 2018 out- 
break in WorldFish’s pond in Malaysia 
is helping them identify promising ge- 
netic variants. In all, the pond contained 
1821 individually tagged tilapia belonging 
to 124 groups of siblings. Each group had 
different pairs of parents. Nearly 40% of all 
the fish died; that was enough to allow the 
researchers to pick out which groups of sib- 
lings had the highest survival rates. 

Just as encouraging, the WorldFish out- 
break enabled researchers to show that 
about 50% of the variation in survival was 
due to genetics. “That bodes well for fu- 
ture breeding to improve resistance to the 
virus,’ says co-author Ross Houston, a fish 
geneticist at the Roslin Institute. The re- 
searchers also found that bigger—and more 
valuable—fish were just as likely to resist 
the disease as the smaller fish. That find- 
ing suggests breeders won’t have to sacrifice 
yield to boost resistance, which would be a 
problem for growers. 

Still, getting resistant tilapia into the 
world’s ponds could take years. Research- 
ers need a genomic test that would al- 
low breeders to quickly identify fish with 
desirable genes, and they need a reliable 
and realistic way to infect the fish to find 
out whether they really are resistant. Rye 
cautions that even if breeders develop a 
winning strain, mass-producing the fish 
in hatcheries and distributing them— 
especially in the developing world—will be 
a tall order. “It’s not changing everything 
overnight,” he says. 

Some companies are pursuing a differ- 
ent approach: developing a vaccine against 
tilapia lake virus. But viable candidates are 
still far off, and even a highly effective vac- 
cine might not be cheap enough to be eco- 
nomically viable, especially because tilapia 
is a relatively low-value fish often grown by 
poor farmers. 

Given such issues—and the threat that 
the virus is continuing to spread—creating 
hardier tilapia breeds has become “a matter 
of urgency,’ Benzie says. “We’re going hell 
for leather on this.” & 
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U.S. RESEARCH FUNDING 


Pentagon’s social science research faces threat 


Minerva initiative falls out of favor in latest DOD review of spending priorities 


By Jeffrey Mervis 


he U.S. Department of Defense (DOD) 

wants to kill a unique social science 

research program meant to help it 

understand nontraditional threats 

to national security, from the rapid 

growth of China’s technological prow- 
ess to radical Islamic terrorism. 

The $20 million Minerva Research Initia- 
tive is in jeopardy after Defense Secretary 
Mark Esper ordered a review that identified 
some $5 billion worth of programs deemed 
less essential to DOD’s mission. Michael 
Griffin, DOD undersecretary for research 
and engineering, has made developing hy- 
personic weapons a top priority, and DOD 
watchers say Minerva doesn’t meet 
Griffin’s definition of what his of- 
fice, which controls a $16.1 billion 
research budget, should fund. In its 
2021 budget request, DOD proposed 
cuts that could lead to the program’s 
demise, but Congress could save it. 

“The current undersecretary 
doesn’t believe that Minerva is sci- 
ence,” says a Democratic congressio- 
nal staffer. “He thinks it’s soft, and 
his priorities are elsewhere.” 

The Minerva program began un- 
der former President George W. 
Bush. “Too many mistakes have 
been made over the years because 
our government and military did 
not understand—or even seek to 
understand—the countries or cultures 
we were dealing with,” then-Defense 
Secretary Robert Gates said in an April 2008 
speech announcing the program. 

Gates envisioned a role for researchers in 
promoting “soft power—the elements of na- 
tional power beyond guns and steel.” He also 
hoped that Minerva would improve what 
he called the often “hostile” relationship be- 
tween DOD and social scientists. 

A 2019 evaluation by the U.S. National 
Academies of Sciences, Engineering, and 
Medicine found that Minerva has met 
both those goals over the past decade. Its 
111 grants—which now average $1.5 million 
over 3 years—“have produced a substantial 
body of research in a variety of areas of im- 
portance to national security ... [and have] 
had a positive impact on the amount of dia- 
logue between DOD and the social science 
community.” But the findings from Minerva 
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projects, it added, have been slow to reach the 
intended audience of policymakers within 
the Pentagon and commanders in the field. 
Although Minerva projects represent a 
tiny fraction of DOD’s $2.6 billion invest- 
ment in basic research, they have gotten out- 
size media attention. One example is work 
conducted by researchers including cultural 
anthropologist Scott Atran of the University 
of Michigan, Ann Arbor, and the University 
of Oxford. It probes the psychology of sui- 
cide bombers, for example, by using brain 
scans to track how people react to perceived 
threats to their deeply held values. 
Economist Eli Berman of the Univer- 
sity of California, San Diego, has worked 
on Minerva projects that embedded social 
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Afghan security forces trained by the U.S. military inspect travelers 
at a checkpoint near the Pakistani border. 


scientists with U.S. military leaders in Af- 
ghanistan to seek ways to strengthen local 
institutions and combat insurgency move- 
ments. “The goal was always to be useful 
right away,’ Berman says. 

As social scientists, Minerva grantees 
focus on promoting understanding and 
preventing conflicts, often through inter- 
disciplinary teams. For example, one proj- 
ect sent anthropologists to do ethnographic 
studies of Indonesian communities, allowed 
sociologists to conduct attitudinal surveys 
in West Africa, and funded computer scien- 
tists to analyze online data related to social 
movements in several regions of the world. 
The result was a web-based tool, called 
LookingGlass, that allows authorities to 
track emerging threats by displaying infor- 
mation about these movements in real time. 
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That approach contrasts with the Penta- 
gon’s traditional emphasis on tapping basic 
research to improve its chances of winning 
battles. “Minerva is not going to support 
the science needed to develop the tools for 
projects that the undersecretary considers a 
priority, like hypersonic weapons,’ says Erin 
Fitzgerald, who directed Minerva for 5 years 
and is now a research administrator at the 
University of Maryland, College Park. “But 
it might help us avoid the need to deploy 
those weapons.” As Atran puts it: “If supe- 
rior machines and hardware were the way 
to win wars, we'd have been out of Afghani- 
stan in 2002 and Iraq in 2003. But it’s peo- 
ple. And we still don’t understand them.” 

Griffin declined comment, but a DOD 
spokesperson says Esper’s review al- 
lowed Griffin’s office to “scrutinize 
and revector our fiscal year 2021 bud- 
get request to align more directly with 
the department’s modernization pri- 
orities.” Asked specifically about the 
value of Minerva, the spokesperson 
said, “The question was not, ‘Is this 
a good effort?’ but rather, ‘Is a dollar 
spent on this effort more important to 
our military capability than spending 
that same dollar on” a priority area? 

DOD’s proposed budget would 
put Minerva on a path to a rapid 
demise. It would eliminate the 
$11 million contribution from the 
basic research office, leaving Mi- 
nerva with some $4 million that 
Congress has recently earmarked 
for research on how other govern- 
ment are using social media to undermine 
U.S. institutions and sway public opinion 
and to examine the threat posed by China, 
Russia, North Korea, and Iran. 

The Navy and Air Force could continue to 
fund projects they think are useful, although 
observers say they are likely to bow out once 
money from the basic research office disap- 
pears. (The Navy has proposed spending 
$3 million in FY2021; the Air Force doesn’t 
break out Minerva but has invested roughly 
$2 million annually in recent years.) 

Minerva backers are hoping its track 
record will persuade Congress to protect 
the program. “Our biggest concern is to 
make sure that the research doesn’t stop,” 
the Democratic staffer says. “We think that 
Minerva is so important in helping DOD 
achieve its mission.” & 
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ASTRONOMY 


New telescope promises boon for Turkish science 


The 4-meter Eastern Anatolia Observatory aims to become an affordable regional hub 


By Umar Farooq 


nternational megaprojects that cost well 

over $1 billion generate most of the ex- 

citement in observational astronomy 

today: the 39-meter Extremely Large 

Telescope under construction in Chile, 

for example, or the Thirty Meter Tele- 
scope, controversial because of its proposed 
location on Mauna Kea, a mountain sacred 
to some Native Hawaiians. 

But smaller telescopes still do cutting- 
edge science as well. And Turkish scientists 
are eagerly awaiting the completion of the 
new Eastern Anatolia Observatory (DAG), a 
4-meter optical and infrared telescope ex- 
pected to come online next year. Its main 
structure is scheduled to be shipped 
to the site, a 3170-meter mountaintop 
in northeastern Turkey, this month; 
polishing and grinding of the primary 
mirror is nearly done. 

Despite its modest $34 million 
price tag, DAG will be one of Asia’s 
largest optical telescopes and Tur- 
key’s largest science project. Its de- 
velopers hope DAG will make Turkey 
a regional astronomy hub and help 
nurture its astronomy community. 
“We are dreaming of using the instru- 
ments on DAG,” says Ozgur Basturk, 
an exoplanet scientist at Ankara Uni- 
versity. “With a 4-meter telescope, we 
can do much better and be more com- 
petitive in our field.” 


world’s largest telescopes. The European 
Southern Observatory (ESO) in Chile, for in- 
stance, prioritizes observing time for projects 
led by scientists from the 16 nations that have 
helped pay for the facility and contribute to 
its €162 million annual budget. “In principle 
anyone can apply to use these telescopes, but 
the competition is very high, so the research 
quality [of your proposals] should be high,’ 
says astronomer Sinan Alis of Istanbul Uni- 
versity. “You have to somehow reach that 
level to be able to compete.” In 2010, Turk- 
ish astronomers began to lobby their govern- 
ment to apply for ESO membership, but they 
gave up a few years ago, their hopes dashed 
by Turkey’s economic crisis. (Another devel- 
oping country, Brazil, dropped its member- 
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Anew telescope, under construction on a 3170-meter mountaintop 


Turkey’s largest telescope today is a 


in eastern Turkey, is expected to come online next year. 


1.5-meter reflector near the city of An- 
talya on the Mediterranean coast. The new 
facility will sit well above haze and humidity 
and far from urban light pollution, and enjoy 
288 clear nights per year on average. Its de- 
sign and construction, financed by the Turk- 
ish government, has involved scientists at 
40 universities in the region, along with Eu- 
ropean academic and industrial partners. 
“Tt’s not a 40-meter telescope of course,” 
says Laurent Jolissaint of the University of 
Applied Sciences and Arts of Western Swit- 
zerland, the project’s lead optical engineer, 
“put we put everything into it. The optics 
system is the same essentially as the very 
newest systems.” DAG will be run by the 
Astrophysics Research and Application Cen- 
ter at Atatiirk University in Erzurum, a city in 
eastern Turkey. 

Turkish scientists have little access to the 
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ship bid last year because it could not afford 
€270 million in fees over the next decade.) 

Unable to make observations themselves, 
many Turkish astronomers fall back on archi- 
val data from large telescopes that have al- 
ready been picked over by other astronomers, 
who can take years to release them. Some 
Turkish astronomers have struck up collabo- 
rations with teams from ESO member coun- 
tries, but even then, getting travel funded is 
often difficult. 

That’s why the new telescope will be such 
a boon, Basturk says—for instance, for his 
exoplanet research. DAG will allow him 
to use one of the standard strategies for 
hunting exoplanets: monitoring stars for 
a wobbling motion that results from the 
gravitational tug of an orbiting planet. But 
like other premier telescopes, DAG will also 
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have a coronagraph, an instrument that 
masks bright stars to remove their glare so 
that dim planets orbiting them can be im- 
aged directly. New technology will enable 
the coronagraph to block multiple stars at 
the same time, needed to hunt for exoplan- 
ets in star systems that, unlike our own, 
have two stars or more. 

DAGQG’s large aperture will enable it to 
capture light from distant, faint objects like 
high redshift galaxies, allowing astronomers 
to probe far back into the universe’s history. 
That’s important for Alis, who hopes to use 
DAG to follow up on data from the Spektr- 
RG space observatory, a Russian-German sat- 
ellite launched last year to map the x-ray sky 
(Science, 14 June 2019, p. 1020). Many objects 
can produce x-rays, from black holes 
at the centers of galaxies to smaller 
black holes or neutron stars consum- 
ing mass from a companion star. “We 
need to follow up with optical and in- 
frared to see what they actually are,” 
Alis said. 

In an ironic twist, DAG officials 
say astronomers from several Eu- 
ropean countries have shown in- 
terest in Turkey’s new observatory, 
despite their access to ESO’s mega- 
telescopes, drawn by the location— 
there are few others in the Northern 
Hemisphere at that longitude—and 
affordable observation time. “We 
envision DAG as an international 
observatory,’ says the project’s di- 
rector, Cahit Yesilyaprak of Atatiirk 
University. “We do not have a prefer- 
ence about the origin of the proposals, as 
long as the best science cases are proposed 
and conducted.” 

But the biggest impact will be in Tur- 
key itself. Designing the telescope has al- 
ready pushed science in other fields. For 
instance, dozens of optical, mechanical, 
and computer engineers helped design the 
telescope’s adaptive optics system, which 
will sharpen its images, and its derotator, 
a device that keeps the telescope trained on 
the same spot as the Earth rotates. “These 
are big deals for a developing country,’ 
Yesilyaprak says. And the telescope itself 
will invigorate Turkish astronomy, Alis says. 
“Once we have the telescope, the expertise 
will pop up.” 


Umar Farooq is a journalist based in Istanbul. 


sciencemag.org SCIENCE 


PHOTO: ATATURK UNIVERSITY ASTROPHYSICS RESEARCH AND APPLICATION CENTER/ANADOLU IMAGES, 


PHOTO: SPL/SCIENCE SOURCE 


THE BODY'S 


ce 


nae 


Lee 


DANGEROUS 
DEFENDERS 


Neutrophils fiercely attack infections, 
but can also trigger disease. Researchers 
are trying to tame them sy Mitch Leslie 


eeking a new treatment for peo- 
ple who have dangerous block- 
ages in their coronary arteries, 
doctors in London are trying to 
disarm the body’s own defend- 
ers. The 90 patients in the study 
receive the usual treatments for 
heart disease, such as small tubes 
called stents to prop open their 
narrowed arteries. But half of the patients 
in the phase II trial, conducted by clinical 
pharmacologist Albert Ferro of King’s Col- 
lege London and colleagues, also pop pills 
targeting a class of immune cells called 
neutrophils. Researchers think that by in- 
vading the fatty obstructions, or plaques, 
in clogged arteries, neutrophils make them 
even more dangerous. The drug is designed 
to steer the cells away. 
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Interfering with neutrophils is an auda- 
cious idea. Neutrophils make up some 70% 
of the white cells in blood, with billions 
spawned every day by stem cells nestled in 
the bone marrow. The cells patrol the blood- 
stream and wage war on pathogens, and 
their defensive role is so vital that people 
who lose their neutrophils in the course of 
cancer treatment can die from infections. 

But their scorched-earth tactics are tough 
on the body. “They are the classic double- 
edged sword,” says immunologist Michael 
Fessler of the U.S. National Institute of En- 
vironmental Health Sciences. For example, 
the enzyme neutrophil elastase, which neu- 
trophils release to kill invading bacteria, also 
erodes the extracellular matrix, the mesh of 
proteins and sugars that cradles cells and 
provides structural support for tissue. 
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Aneutrophil (orange) traps bacteria by extruding a 
net of DNA—a protective response that can backfire. 


Neutrophils can also malfunction. In 
some conditions, such as chronic obstruc- 
tive pulmonary disease (COPD), they seem 
to lose their sense of direction: Instead of 
homing in on bacteria, they wander. Re- 
searchers suspect these vagrant neutrophils 
release neutrophil elastase and other mol- 
ecules at the wrong spots, including the 
lungs, causing the tissue damage seen in 
COPD. And as people age, neutrophils be- 
come less effective at combating infections 
and more harmful to the body. 

“We are now beginning to understand 
that neutrophils are linked to many diseases 
that blight our population,” says pulmono- 
logist Elizabeth Sapey of the University of 
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Birmingham in the United Kingdom. With 
that realization has come optimism that 
these cells aren’t off-limits for medical in- 
terventions. Researchers have been cau- 
tious about targeting neutrophils, fearing 
that doing so would leave patients at the 
mercy of pathogens. “The age-old idea was 
that you couldn’t touch neutrophils,” Sapey 
says. But a series of clinical trials has of- 
fered reassurance that it can be safe to re- 
strain the cells. 

Although some of those trials yielded 
mediocre results, researchers have pressed 
on—and their persistence may be paying 
off. Two clinical trials with neutrophil- 
focused drug candidates reported encour- 
aging findings last year; one could lead to 
the approval of the first medication tar- 
geting these immune cells, in a rare auto- 
immune condition. The trial Ferro and his 
colleagues are conducting, the first to at- 
tempt to stem cardiovascular disease by 
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Blunting the attack 


interfering with neutrophils, is expected to 
have results by 2021 or 2022. And a burst of 
new findings about the cells’ diversity and 
behavior is likely to inspire new ways to ma- 
nipulate them. “There is real hope in this 
field,’ Sapey says. 


NEUTROPHILS FLOCK to the site of an injury 
or infection, wherever it occurs. The cells 
navigate to trouble spots with the help 
of receptors on their surface, such as one 
known as CXCR2, which detects a trail of 
alarm molecules released by damaged tis- 
sue. If the cells happen upon microbes, 
they launch a multipronged attack, gob- 
bling the pathogens, spilling corrosive 
chemicals, and sometimes launching webs 
of DNA, known as neutrophil extracellular 
traps (NETs), that snag and kill the invad- 
ers. Hematopoietic stem cell transplants 
given to some cancer patients to replace 
their bone marrow underscore the im- 


Released by neutrophils, vesicles (yellow) carrying 


protein-degrading enzymes can bind to and damage 


collagen fibrils (blue) in connective tissue. 


Ongoing clinical trials aim to treat a range of diseases by targeting proteins that abet tissue damage by neutrophils. 
Earlier tests had reassured researchers that the strategy would not leave patients defenseless against infections. 


CONDITION TARGET DRUG STATUS DRUG MANUFACTURER 

ANCA vasculitis C5a (molecule that helps switch Avacopan  Phaselllcompleted ChemoCentryx 
on neutrophils) 

Chronic obstructive CXCR2 (receptor that helps guide Danirixin Phase ll completed — GlaxoSmithKline 

pulmonary disease neutrophils to damaged tissue) 

Cardiovascular disease CXCR2 AZD5069 ~=—- Phase llunderway = AstraZeneca 

Asthma PI3K (protein that helps Nemiralisib Phase llcompleted GlaxoSmithKline 
control neutrophil migration) 

Cystic fibrosis Neutrophil elastase POL6014 ~—- Phasel/Ilunderway Santhera 
(tissue-damaging enzyme) Pharmaceuticals 

Pulmonary arterial Neutrophil elastase Elafin Phase | underway Proteo Biotech 


hypertension 
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portance of these defenses. “If the graft 
doesn’t take, you will have no neutrophils, 
and that is not survivable,” says immuno- 
logist Klaus Ley of the La Jolla Institute 
for Immunology. 

Yet studies over the past 2 decades have 
darkened their image. The cells in effect 
“drop atomic bombs” in the body, says 
immunologist Paul Kubes of the Univer- 
sity of Calgary. Neutrophil elastase and 
other chemicals they discharge may drive 
inflammation and damage to the airways 
in cystic fibrosis and intractable cases of 
asthma. NETs, for their part, can provoke 
the immune system to attack a patient’s 
own cells in the autoimmune disease lupus, 
and they spur formation of blood clots in 
the potentially lethal condition deep-vein 
thrombosis. NETs can also worsen arterial 
plaques by promoting inflammation, cell 
biologist Venizelos Papayannopoulos of 
the Francis Crick Institute in London and 
colleagues found in 2015. Neutrophils can 
even abet cancer, spurring growth of new 
blood vessels that feed tumors and helping 
the abnormal cells spread to other parts of 
the body. 

More than 10 years ago, such disease 
links prompted some companies to begin to 
develop compounds that inhibit neutrophil 
elastase. Others tried to restore neutrophils’ 
navigational abilities, which are faulty in 
COPD and other conditions, by blocking the 
protein PI3K, an enzyme involved in con- 
trolling cell movement. 

More than a dozen clinical trials scru- 
tinized these potential drugs in many con- 
ditions. The good news is that the com- 
pounds didn’t cripple defenses against 
infections. But most of the trials found min- 
imal benefits. As a result, several big phar- 
maceutical companies have given up. In 
2019, GlaxoSmithKline jettisoned danirixin 
and nemiralisib, the two neutrophil- 
targeting drug candidates it had been 
developing for lung diseases. Merck and 
AstraZeneca have also recently abandoned 
once-promising compounds. 

But researchers say there’s still hope, ar- 
guing they haven’t yet pinned down what 
doses to use, how best to deliver potential 
drugs, and which aspects of neutrophil 
biology to target. In trials for COPD and 
other lung disorders, Sapey notes, patients 
inhaled the compounds. But because most 
neutrophils ply the bloodstream, inhaled 
drugs may not reach faulty cells, she says. 
Nervous about side effects, investigators 
may also have kept doses too low. 

Hematologist and oncologist Steven 
Pavletic of the U.S. National Cancer Insti- 
tute and his colleagues think they can do 
better. In a phase I safety trial, Pavletic and 
his team have started to give larger amounts 
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of the neutrophil elastase inhibitor alve- 
lestat to patients with the lung condition 
bronchiolitis obliterans. The ailment strikes 
a significant percentage of people who re- 
ceive hematopoietic stem cell transplants 
during cancer treatment, developing when 
immune cells from the transplant attack the 
recipient’s own tissue. Neutrophils swarm 
into the lungs and may encourage scar tis- 
sue to build up in the bronchioles, the lungs’ 
smallest airways, which can cause them to 
clog. For transplant recipients, bronchiol- 
itis obliterans is like having an unstoppable 
asthma attack, Pavletic says. 

He adds that alvelestat probably won’t re- 
verse bronchiolitis obliterans, but by block- 
ing neutrophil elastase, it could stop the 
disease from getting worse. “Even a mar- 
ginal improvement could lead to a major 
improvement in lung function.” 

Researchers are further along in evaluat- 
ing a way to target neutrophils to treat a 
rare autoimmune disease in 
which the immune system 
produces antibodies against 
its own neutrophils—a condi- 
tion known as antineutrophil 
cytoplasmic autoantibody 
(ANCA) vasculitis. The anti- 
bodies glom onto the cells, 
which then get stuck in small 
blood vessels and release their 
chemical load, leading to in- 
flammation that can cause 
kidney damage and other 
problems. In the first year after diagnosis, 
patients with the disease are nine times 
more likely to die than people in the general 
population, and current treatments, which 
include steroids and immune-suppressing 
drugs, are one reason for this high mortal- 
ity, notes nephrologist David Jayne of the 
University of Cambridge. 

In a recent trial involving 331 ANCA 
vasculitis patients, Jayne and colleagues 
tested the experimental drug avacopan, 
which stymies C5a, a protein in the blood 
that helps spur neutrophils to release their 
inflammation-promoting contents. The re- 
sults of the phase III trial, announced late 
last year, revealed that the drug was about 
20% more likely to produce remissions af- 
ter 1 year than steroids were. It improved 
patients’ kidney function and was also less 
harmful than steroids. Jayne predicts ava- 
copan will be approved by the U.S. Food 
and Drug Administration and “will be a 
complete replacement for steroids” in treat- 
ment of ANCA vasculitis. 

Although many efforts to target neu- 
trophils aim to inhibit them, Sapey and 
her colleagues believe rejuvenating neu- 
trophils could help boost immune func- 
tion in older people. An existing drug, the 
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“Neutrophils 
are linked to 
many diseases 
that blight 


our population.” 


Elizabeth Sapey, 
University of Birmingham 


cholesterol-lowering medication simvas- 
tatin, may do the job, they found. Simvas- 
tatin had caught their attention because 
some evidence suggested people taking it 
for high cholesterol are less vulnerable to in- 
fections. When the team members exposed 
neutrophils to simvastatin in the lab, they 
found it boosts the cells’ migration accuracy 
and improves their performance in other 
ways. And in a phase II trial of the drug in 
62 older people with bacterial pneumonia 
and sepsis, Sapey and her colleagues deter- 
mined that adding simvastatin to the nor- 
mal treatment regimen reduced the severity 
of infections and allowed the patients to 
spend more time out of the hospital over 
the next year. 

“That was highly unexpected and very ex- 
citing,” says Sapey, whose team reported the 
findings last year in the American Journal 
of Respiratory and Critical Care Medicine. 
They are now planning a larger study of the 
drug, and she says it might 
also work against other com- 
mon bacterial problems in 
the elderly, such as urinary 
tract and skin infections. 


THE THERAPEUTIC possibili- 
ties are likely to broaden as 
researchers learn more about 
the biology of neutrophils. 
The cells have traditionally 
been seen as the immune sys- 
tem’s cannon fodder—simple, 
expendable killers. But neutrophils, it turns 
out, aren’t so simple after all. “They are not 
merely the cells that clear microbes from in- 
fected sites,’ Papayannopoulos says. 

“The most exciting thing is that the 
cells are much more heterogeneous than 
we thought,’ says immunologist Leo 
Koenderman of University Medical Center 
Utrecht in the Netherlands. With tech- 
niques such as single-cell RNA sequencing, 
researchers have discovered that neutro- 
phils come in multiple varieties. And their 
properties can vary by time of day. In hu- 
mans, the cells are much more aggressive 
toward pathogens at night than during 
daylight hours, and they crank up different 
genes depending on the hour. 

This diversity enables the cells to specialize 
and take on more tasks than researchers ex- 
pected. Some neutrophils have moved up the 
chain of command and help control the activ- 
ity of other immune cells. For example, they 
can either spur T cells to attack pathogens 
or rein them in. The cells can also take on 
alternative roles when they enter damaged 
tissue. If they encounter microbes, they be- 
come fighters. But if an injury is not infected, 
they become healers. Chemicals they release 
can stimulate new blood vessels to form and 
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spark the production of replacement cells. 

Intravital microscopy, which reveals the 
movements of cells within the body, sug- 
gests the lives of these immune cells are 
eventful. Researchers long assumed that 
once neutrophils left the bloodstream and 
moved into infected or injured tissues, they 
would eventually self-destruct, allowing the 
inflammation at the site to clear. But sev- 
eral microscopy studies have revealed more 
complex migrations. In a 2011 study, for 
example, Anna Huttenlocher of the Univer- 
sity of Wisconsin, Madison, and colleagues 
tracked neutrophils in zebrafish that had 
wounds on their fins. The scientists found 
that individual neutrophils moved into and 
out of a wound several times before eventu- 
ally departing for good. 

Now, Ferro and his colleagues are trying 
to interrupt one leg of the neutrophils’ jour- 
ney. In patients with heart disease, the cells 
worm into the fatty plaques lining the coro- 
nary arteries. There, they release chemicals, 
including neutrophil elastase, that may 
make these plaques more likely to fracture 
and spawn blood clots, which can trigger 
heart attacks. 

The drug candidate that Ferro and col- 
leagues are testing in people with serious 
plaques, AZD5069, blocks CXCR2, the re- 
ceptor that helps neutrophils navigate to 
infected and inflamed locations. In animal 
tests, the compound deters neutrophils 
from entering sites of inflammation. 

Death rates from cardiovascular diseases 
have been dropping for decades in many 
countries thanks to lifestyle changes, such as 
a decline in smoking, as well as drugs that 
lower cholesterol and combat blood clot- 
ting. But heart disease still kills more than 
600,000 people in the United States every 
year. “There's clearly a big gap that remains 
in therapy for these patients,’ Ferro says. 
Within a couple of years, he hopes to know 
whether deflecting neutrophils can help. 

Some researchers remain skeptical that 
the current crop of experimental neutrophil- 
targeting drugs will work outside certain 
rare diseases. The cells have multiple re- 
dundant pathways that control their ac- 
tivity, Koenderman notes. A neutrophil “is 
extremely robust,” he says. The idea that a 
single drug could correct its misbehavior “is 
pretty naive,” he says. 

However, researchers are already de- 
veloping new approaches, including DNA- 
destroying enzymes that might slice up 
NETs and prevent blood clots. And as sci- 
entists dig deeper into the cells’ biology, 
they may find new ways to keep these im- 
mune soldiers in check, Fessler says. “There 
is hope that with increasing understanding 
of neutrophils, we will have more sophisti- 
cated approaches down the line.” 
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Understanding the mother- 
breastmilk-infant “triad” 


Breastmilk research holds important opportunities 
to improve maternal-child health 


By Lars Bode’, Arjun S. Raman’, Simon H. 
Murch’, Nigel C. Rollins*, Jeffrey 1. Gordon? 


reastfeeding and breastmilk ex- 
ert remarkable influence on infant 
survival and health (J, 2), includ- 
ing reduced risk from infections 
and promoting various aspects of 
postnatal development. The many 


maternal benefits include protection from 
breast and ovarian cancer and cardiometa- 
bolic disorders. Although the mechanisms 
underlying some of these benefits have 
been elucidated, the origins of others that 
have been reported, such as influence on 
adult IQ and later protection against obe- 
sity and diabetes, remain more obscure. 
Hence, timely investments in research de- 


signed to clarify the operations and biolog- 
ical effects of the mother-breastmilk-infant 
“triad,” and their translation into public 
health, are needed. 

Breastmilk does not stand alone; ma- 


ternal physiology, breastmilk composi- 
tion, and infant physiology are parts of 
a coadapting system, with variations in 
each influencing the trajectory of infant 
development and maternal health. In ad- 
dition to macronutrients and micronutri- 
ents essential for child survival, breastmilk 
contains other myriad bioactive compo- 
nents, including cells and microbes (3, 4). 
Breastmilk can be considered a “live tissue” 
whose composition varies between women 
and changes over the course of lactation. 
Structurally diverse human milk oligo- 
saccharides (HMOs) represent the third 
most abundant nonaqueous component of 
breastmilk (after lactose and lipids). One 
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A Rohingya Muslim refugee holds her 
malnourished child in Bangladesh. Better 
understanding of breastmilk could improve 
therapeutic foods to treat undernutrition. 


prominent example illustrating how ma- 
ternal genotype affects breastmilk compo- 
sition is a single-nucleotide polymorphism 
that introduces a premature stop codon in 
the fucosyltransferase-2 (FUT2) gene. This 
mutation abolishes the ability to synthe- 
size a(1-2)-fucosylated HMOs. The pres- 
ence or absence of these HMOs creates 
specific maternal “lactotypes,’ known as 
secretors and nonsecretors, respectively, 
with the breastmilk of secretors conveying 
reduced risk of common forms of infec- 
tious diarrhea (5). 

Comprehensive characterization of 
the components of each axis of the triad 
through longitudinal and cross-sectional 
studies of maternal-infant cohorts has 
expanded markedly. Increasingly, high- 
throughput analytical methods have been 
used to characterize more than 150 differ- 
ent HMO structures, and intra- and inter- 
personal variations in their representation 
within and across different populations 
(6). Other components of breastmilk, in- 
cluding compounds associated with 
the membrane that surrounds milk 
fat globules, microRNAs, and bacte- 
rial constituents, as well as antibodies 
and immune cells, are being actively 
cataloged and characterized. In ad- 
dition to quantifying the products of 
metabolism in infants and their moth- 
ers by mass spectrometry, platforms 
are now available for simultaneously 
measuring the concentration of thou- 
sands of proteins circulating in blood 
that are biomarkers and regulators of 
numerous physiologic, metabolic, and 
immune processes, as well as other 
facets of growth and homeostasis 
(7). Furthermore, recent studies have 
highlighted how, with the use of cul- 
ture-independent methods, features 
of gut microbial community develop- 
ment in infants and young children 
can be used as a readout for their nu- 
tritional status (8, 9). 

Although datasets pertaining to 
each axis are available, considerably 
more work is needed to quantitatively 
relate how environmental influences 
affect the triad and in turn, how vari- 
ations in each of its axes influence the 
other (see the figure). For example, it 
will be interesting to discover what 
other maternal genetic factors af- 
fect biosynthesis of HMOs and other 
milk constituents. The mechanisms 
that link maternal nutritional sta- 
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Breastmilk 
components 


tus and other aspects of their physiology 
to breastmilk features and infant growth 
phenotypes are also an important issue. 
Additionally, which signaling pathways 
allow infant health status to regulate ma- 
ternal biology, including breastmilk com- 
position, should be investigated. How 
varied sociocultural, behavioral, and envi- 
ronmental factors shape and perturb the 
development of the triad is important to 
understand so that a “normal” range can 
be defined. Two disorders, childhood mal- 
nutrition and necrotizing enterocolitis 
(NEC), illustrate how a deeper understand- 
ing of the mother-breastmilk-infant triad 
could improve child health with potential 
lifelong benefits, and how some of the ana- 
lytic challenges might be surmounted. 
Childhood malnutrition contributes to 
45% of deaths worldwide in those under 
the age of five; it manifests early in life and 
involves disruption of multiple biological 
systems fundamental to healthy growth, 
including host pathways influenced by 
the developing gut microbiota, which are 
key consumers of breastmilk constituents 
(8, 9). One approach for obtaining new 
insights about disease pathogenesis is to 
conduct longitudinal studies of healthy 


How does breastmilk affect 


maternal and infant health? 
Mechanistic insights hold the promise of providing more 
informative definitions of health status, better predictions 
of health outcomes, improved recommendations 

for preventing disease, and new therapeutic targets. 
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and malnourished children living in areas 
where disease burden is high, and to 
comprehensively characterize the plasma 
proteomes, metabolomes, and developing 
microbial communities of malnourished 
infants and their healthy counterparts, 
their mothers’ breastmilk composition, and 
the products of microbial HMO utilization. 

A strategy for defining functionally im- 
portant interactions between triad compo- 
nents is to borrow from studies conducted 
in disparate fields where statistical co- 
variation is used for “feature reduction.” 
Components that covary with each other 
are deemed important for defining the be- 
haviors or functions of dynamic complex 
systems. Applying this approach to the de- 
veloping gut microbiota of healthy mem- 
bers of a Bangladeshi birth cohort sampled 
monthly from 1 to 60 months of age dis- 
closed a network composed of 15 covary- 
ing bacterial taxa (9). The abundances of 
these taxa describe normal gut microbial 
community assembly in healthy members 
of birth cohorts residing in diverse geo- 
graphic locales and are useful for quan- 
tifying the degree of impaired microbial 
community development in children with 
moderate and severe acute malnutrition. 

From the limited evidence avail- 
able, microbiota immaturity asso- 
ciated with these conditions is not 
repaired with standard therapeutic 
foods. Affordable, culturally accept- 
able complementary foods have been 
identified that in combination repair 
the gut microbiota of Bangladeshi 
children with moderate acute mal- 
nutrition toward a state resembling 
that of age-matched, healthy grow- 
ing children. This is accompanied 
by increases in numerous blood 
plasma protein biomarkers and me- 
diators of growth, bone formation, 
neurodevelopment, metabolism, and 
immune function (8, 9). 

These findings support the idea 
that healthy growth is linked in part 
to healthy development of the gut 
microbiota. They also raise the ques- 
tion of what factors shape microbial 
community development during the 
period of exclusive breastmilk feed- 
ing, and as children transition to 
complementary foods during the 
weaning period. Members of the bac- 
terial genus Bifidobacterium, notably 
B. longum subsp. infantis, have suites 
of genes involved in the import and 
metabolism of HMOs. It is impor- 
tant that efforts be directed to defin- 
ing the representation of B. infantis 
and other HMO-consuming bacteria 
in healthy versus malnourished in- 
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fants and their mothers. This information, 
together with characterizing the repre- 
sentation of genes involved in HMO ac- 
quisition and degradation in different bac- 
terial strains cultured from these children, 
would allow an assessment of (i) whether 
and how the presence of these different 
organisms and their genome features cor- 
relate with maternal breastmilk composi- 
tion and (ii) the degree to which products 
of breastmilk metabolism correlate with 
host features. The answers, from analyses 
of human biospecimens as well as animal 
models colonized with consortia of human 
gut microbes representing different stages 
of community assembly (0, 11), could 
have important therapeutic implications. 
These include the development of new pro- 
biotic, HMO-based prebiotic and/or synbi- 
otic (prebiotic combined with probiotic) 
therapies (72). 

NEC provides a different type of oppor- 
tunity to characterize the mother-breast- 
milk-infant triad. One of the most com- 
mon and fatal gastrointestinal disorders 
in preterm infants, NEC develops within 
the first few weeks of delivery. It is char- 
acterized by destruction of the integrity 
of the intestinal wall, invasion of luminal 
bacteria, marked inflammation, and sep- 
sis. Maternal and infant physiology are 
immature after preterm delivery in terms 
of producing and digesting breastmilk. 
Moreover, the use of antibiotics and other 
medications and interventions, when both 
mother and infant face serious and often 
life-threatening crises, further disrupts the 
mother-breastmilk-infant triad, including 
initial colonization of the infant intestine. 
Although breastmilk composition is not 
fully adapted to the physiological needs of 
the premature infant, breastmilk feeding, 
compared to enteral feeding with special- 
ized breastmilk substitutes, reduces NEC 
incidence by 6- to 10-fold (13). The mecha- 
nisms underlying these protective effects 
remain largely uncharacterized. 

HMOs significantly improve survival and 
reduce pathology in a neonatal rat model 
of NEC, leading to the identification of the 
HMO, disialyllacto-N-tetraose (DSLNT), 
as a protective factor (14), likely through 
its direct interactions with gut epithelial 
and immune cells. A multicenter study of 
mothers and their very-low-birthweight 
infants found that infants who developed 
NEC received breastmilk containing less 
DSLNT than infants who did not develop 
NEC (15). Proof of a causal relationship 
requires a randomized controlled clini- 
cal trial, which raises several challenges, 
including the availability of DSLNT and 
ethical considerations if control groups of 
high-risk infants were to be treated with 
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formula alone. More generally, NEC illus- 
trates the need to comprehensively define 
states of “triad immaturity.” This would 
entail longitudinal studies of the set of fea- 
tures that define breastmilk given to pre- 
maturely born neonates who do and do not 
develop this devastating disease. It would 
also require a simultaneous effort to obtain 
comprehensive definitions of the biologi- 
cal characteristics of chronologically age- 
matched preterm infants with and without 
NEC, as well as of their mothers. 

Mothers face a “balancing-act” between 
various socioeconomic, cultural, and even 
marketing pressures to maintain or forego 
breastfeeding and their motivation to pro- 
vide their infants with what is best for 
their health and development. This balanc- 
ing act is perpetuated in part by confusion 
surrounding the respective attributes of 
breastmilk versus breastmilk substitutes, 
with consumer understanding being heav- 
ily influenced by commercial interests. 
Aspirational goals include new parameters 
for defining health status and deeper un- 
derstanding of how health outcomes are 
related to breastfeeding and breastmilk 
components. Within a risk-stratified con- 
tinuum of care, knowledge of the latter 
has potential therapeutic implications and 
opportunities, personalized to the circum- 
stances of an individual mother and her in- 
fant (7). Such efforts will not only provide 
new appreciation of the remarkable prop- 
erties of nature’s first food, but also serve 
to further develop analytic approaches 
that yield insights into the dynamic sys- 
tems that direct infant development. 
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IMMUNOLOGY 


Origins of 
peanut allergy- 
causing 
antibodies 


Analysis of gut-produced 
antibodies raises 
questions about how food 
allergy arises 


By Duane R. Wesemann! and 
Cathryn R. Nagler? 


ome people produce immunoglobulin 
E (IgE) antibodies to proteins in com- 
mon foods. As a result, these foods 
can trigger severe allergic inflamma- 
tion (anaphylaxis). There are several 
structurally and functionally distinct 
antibody isotypes (IgM, IgD, IgG, IgA, and 
IgE), and which isotype binds to a target 
molecule (antigen) influences what hap- 
pens next. For example, IgG that binds pea- 
nut proteins is harmless, but IgE bound to 
the same proteins can induce anaphylaxis 
and death. Therefore, how, where, and why 
allergen-reactive IgE is made are decades- 
old questions. Hoh et al. (J) found that gut 
tissue is a likely place for IgE development 
in peanut-allergic individuals. In addition, 
despite vast sequence possibilities, they 
found that many individuals share similar 
peanut-reactive IgE DNA sequences. This 
suggests that IgE antibodies in different 
individuals recognize peanut proteins in a 
similar manner, which could inform strate- 
gies for pharmacological interventions. 
Antibodies are produced by cells of 
the B lymphocyte lineage and consist of 
four Ig polypeptide chains—two identical 
heavy (H) chains and two identical light 
(L) chains—and each chain has a variable 
(V) region and a constant (C) region. The 
V region forms the surface that physically 
binds to antigens such as peanut proteins. 
The C region of IgH (CH) dictates antibody 
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function largely by triggering 
processes that lead to neutral- 
ization, elimination, or induc- 
tion of inflammation. Assembly 
of multiple JGH gene segments 
through the process of V(D)J 
recombination results in enor- 
mous sequence diversity, partic- 
ularly in the exon encoding the 
V region, which comprises much 
of the antigen binding surface. 
Next to the V exon are a vari- 
ety of C-region exons arranged 
in tandem (e.g., Cu, Cy, Ce, and 
Ca), which define the different 
antibody isotypes (2). 

Newly assembled Ig is pro- 
duced as IgM. However, during 
an immune response, the same 
V region of IgH (VH) can be ex- 
pressed in the context of another 
C-region isotype through a pro- 
cess called class switch recombi- 
nation (CSR). This positions an 
alternative CH region next to the 
VH exon by permanent excision 
of the intervening DNA and as- 
sociated CH-encoding exons. The 
position of CH exons relative to 
each other thus determines which 


Sources of peanut allergy 


Allergic reactions to peanut proteins 


are caused by immunoglobulin E (IgE) > 


antibodies. Bone marrow is a source ~ 
of IgE antibody for systemic distribution. 

Hoh et al. demonstrate that gut tissues 

are also a source of IgE. The degree to which 

gut IgE contributes to systemic allergy ¢ 
and bone marrow IgE contributes to gut 
sensitization is unknown. 
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IgE antibodies are composed of heavy 

and light chains, with a variable region 
that recognizes antigen and an IgE-specific 
constant region, which is derived through 
class switch recombination. 


IgH isotypes are available for sec- 

ondary switch events. For example, an IgM- 
expressing B cell can switch to IgGl, and 
then that same IgG1* B cell can switch to IgE 
because Ce is 3’ (downstream) to Cyl in the 
IGH locus. However, IgE-expressing B cells 
cannot switch to IgG1 (or any other isotype 
whose CH is positioned 5’ to Ce). Somatic 
hypermutation (SHM) further diversifies V 
regions by introducing mutations that can 
enhance affinity to antigen (3). 

The V(D)J recombination-, CSR-, and 
SHM-mediated processing of JGH enable 
phylogenetic mapping of how B-lineage cells 
and the antibodies they produce are related 
to one another. To gain insights into the ori- 
gins of IgE production and the relationships 
between antibody-producing cells, Hoh et 
al. sequenced the /GH genes from B-lineage 
plasma cells in upper digestive tract tissues 
of 19 peanut-allergic individuals and com- 
pared them with those of nonallergic con- 
trols. They found more IgE-expressing cells 
in gut tissue in food-allergic individuals, 
confirming previous findings (4). Multiple 
clonally related VH-encoding sequences in 
IgE antibodies were also shared with other 
IgH isotypes. These findings suggest that B 
cells undergo CSR to IgE in the gut tissue as 
opposed to undergoing CSR to IgE elsewhere 
before migrating to the gut. This is different 
to the case of bone marrow, which is a major 
destination for antibody-producing cells af- 
ter CSR elsewhere. This raises an important 
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question: What features of the gut environ- 
ment favor CSR to IgE? Moreover, because 
the bone marrow is a major location of an- 
tibody production, including IgE in allergic 
disease (5), the degree to which gut-derived 
versus bone marrow-derived IgE affects 
clinical disease, prognosis, and treatment 
approaches remains to be determined (see 
the figure). 

Hoh et al. identified antibody sequences 
that are reactive to the peanut protein Arah 
2 (Arachis hypogaea allergen 2) and found 
groups of similar sequences among mul- 
tiple individuals. Similar sequences were 
also found in analyses of IgE* B cells from 
peripheral blood in individuals with peanut 
allergy (6), further validating the concept of 
convergent IgE development to peanut pro- 
teins. Nonallergic individuals also had Ara 
h 2-reactive sequences, but only in non-IgE 
isotypes such as IgM, IgG, and IgA. 

These findings highlight how antibodies 
that induce a food-allergic response are gen- 
erated. The production of antibodies that 
bind peanut proteins does not seem to be 
the problem per se; instead, the switching 
of that antibody to the IgE isotype appears 
to be key. This is consistent with observa- 
tions that humans make IgG to a variety of 
dietary proteins without correlation to food 
allergy (7). In addition, it is possible that 
IgG to food antigens may be protective from 
food allergy by either blocking IgE binding 
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or otherwise interfering with IgE 
function (8, 9). Perhaps an inter- 
vention that discourages gut IgE 
CSR could prevent food allergy. 

The convergence of IgE se- 
quences in multiple peanut-al- 
lergic individuals suggests that 
immune recognition may occur 
through antibody binding to a 
few finite regions on key pro- 
teins. In this regard, drugs that 
block IgE binding to these re- 
gions holds promise as a therapy 

in allergic disease. There is proof 
of principle: Two  therapeu- 
tic monoclonal IgG antibodies 
against a cat allergen inhibited 
IgE binding, and treatment with 
the combination of these two 
antibodies alone was sufficient 
to reduce allergic symptoms in 
34 cat-allergic individuals in a 
clinical trial (JO). Blocking IgE 
to peanut antigens may be simi- 
larly efficacious. 

Although seemingly innocu- 
ous, food allergens may influence 
the gut environment to gener- 
ate conditions that induce CSR 
to IgE. In this regard, allergenic 
foods may have properties that 

induce allergic inflammation, as has been 
proposed (1/, 12). Understanding the influ- 
ences of the gut microbiota, age of expo- 
sure, and environment on the regulation of 
allergic responses to food (13, 14) promises 
to provide clues to elucidating how IgE CSR 
is regulated. m™ 
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MOLECULAR BIOLOGY 


Coding functions of “noncoding” RNAs 


Hundreds of RNA regions that encode microproteins are found to regulate cell growth 


By Lian-Huan Wei and Junjie U. Guo 


igh-throughput RNA _ sequencing 
studies have revealed pervasive 
transcription of the human ge- 
nome, which generates a variety of 
long noncoding RNAs (IncRNAs) 
that have no apparent protein- 
coding functions (J). Subsequent studies 
that globally monitor translation have 
similarly identified numerous transla- 
tion events outside of canoni- 
cal protein-coding sequences 
(2-4), suggesting pervasive 
translation of the transcrip- 


and identified more than 5000 previ- 
ously unannotated ORFs that showed ro- 
bust evidence of active translation. These 
newly identified ORFs include variants 
of canonical ORFs (e.g., canonical ORFs 
with an alternative initiation position), 
upstream ORFs (uORFs) within 5’UTRs, 
and ORFs within transcripts that are an- 
notated as IncRNAs. Consistent with previ- 
ous studies, these ORFs are shorter (fewer 
than 100 amino acids; hence, they are also 


Possible functions of noncoding RNA translation 


Aside from the main open reading frames (ORFs) of messenger RNAs (mRNAs), 


Using CRISPR-Cas9, Chen et al. dis- 
rupted each of 2353 selected unannotated 
ORFs and identified more than 400 ORFs 
that promoted cell growth in both human 
K562 leukemia cells and induced pluripo- 
tent stem cells. In addition to changes in 
cell growth, the disruption of many unan- 
notated ORFs also caused specific changes 
in gene expression. Consistent with trans- 
lation being essential, the authors found 
that start codons were critical for the func- 
tions of a few ORF-containing 
IncRNAs. However, among 
all the IncRNAs that affected 
cell growth when they were 


tome. However, only a few ex- 
amples of functional peptides 
encoded by RNA regions previ- 
ously thought to be noncoding 
have been reported to regulate 


translation also occurs in upstream ORFs (uORFs) within 5’ untranslated 
regions (UTRs) as well as ORFs within cytosolic long noncoding RNAs (IncRNAs). 
These translation events may either have no function, regulate main ORF 
translation, regulate noncoding functions of IncRNAs, produce functional peptides 
that interact with main ORF-encoded proteins, or function independently. 


transcriptionally silenced by 
CRISPR interference, only a 
small subset showed consistent 
phenotypes when their embed- 
ded ORFs were disrupted, sug- 


distinct biological processes 
(5-9). On page 1140 of this is- 
sue, Chen et al. (10) provide 
evidence for an expanded rep- 
ertoire of functional peptides 
encoded by IncRNAs and other 
“untranslated” RNA regions. 
Sequencing of ribosome-pro- 
tected messenger RNA (mRNA) 
fragments (RPFs) allows re- 
searchers to globally identify 


uORF 
i 


ee AAAAA ai 


mRNA 


IncRNA 


Nucleus 


RNA regions that are trans- Ribosome No function (translation noise) 
lated (open reading frames, 
or ORFs). In addition to the 

; : ote 
canonical open reading frames —Yy AAAAA 


(main ORFs) of mRNAs, RPFs 
have also been found in the 
untranslated regions (UTRs) of 
mRNAs and in many IncRNAs 
(2-4). Although some RPFs may 
not represent bona fide transla- 
tion (J), the question remains 
whether translation of ncRNAs 
and UTRs is functional or rep- 
resents an _ inconsequential 
amount of translation that occurs on all cy- 
tosolic 5'-capped RNAs (see the figure). 
Chen et al. used genome-wide loss-of- 
function screens to systematically assess 
the functionality of noncanonical ORFs 
in cell growth. They first filtered RPFs 
obtained from several human cell types 


Department of Neuroscience, Yale University 
School of Medicine, New Haven, CT 06520, USA. 
Email: junjie.guo@yale.edu 


1074 6 MARCH 2020 + VOL 367 ISSUE 6482 


Translation-dependent, product-independent function 
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called microproteins) and less conserved 
than canonical ORFs. They often use near- 
cognate translation start codons such as 
CUG, whereas canonical ORFs are pre- 
dominantly initiated at AUG start codons. 
More than 200 newly identified micropro- 
teins were validated by orthogonal pro- 
teomic and peptidomic analyses in mul- 
tiple human cell lines, suggesting that at 
least some microproteins accumulate in 
cells to detectable concentrations. 
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Cytoplasm 


gesting noncoding functions of 
the remaining IncRNA loci. 

Peptides encoded by In- 
cRNAs and uORFs_ within 
5'UTRs showed specific sub- 
cellular localization and pro- 
tein interactions. Notably, 
some uORF-encoded peptides 
formed complexes with the 
proteins encoded by the cor- 
responding main ORFs. The 
functional importance of such 
interactions remains to be 
tested, as well as the general- 
ity of eukaryotic mRNAs en- 
coding multiple subunits of a 
protein complex. Also unre- 
solved is the extent to which 
the growth phenotypes of some 
uORFs may be partly due to 
their regulation of main ORF 
translation and independent of 
the uORF-translated peptides 
(ie., translation-dependent, 
product-independent), which 
is a well-established function of uORFs 
(12). Future studies on the genetic inter- 
actions between uORFs and main ORFs 
within the same mRNAs should provide an 
answer. Analogous to the translation regu- 
lation function of uORFs, translation may 
also regulate the function and stability of 
some bona fide IncRNAs by remodeling 
RNA structures and/or ribonucleoprotein 
composition, a possibility that remains 
largely unexplored. 
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The study of Chen et al. demonstrates 
the challenge of identifying microproteins 
by evolutionary conservation. Many short 
ORFs that encode conserved peptides, in- 
cluding some IncRNA ORFs identified by 
Chen et al., have indeed been annotated 
as such in some databases. However, most 
microproteins found to be important for 
cell growth do not show strong evolution- 
ary conservation of their amino acid se- 
quences, as canonical proteins typically do. 
Perhaps not all the ORFs that regulate cell 
growth in vitro would have similar impact 
on organismal fitness (13). Nevertheless, 
extrapolating from this study, many more 
interesting peptides encoded by noncanon- 
ical ORFs must exist and regulate far more 
biological processes than currently known. 
Their frequent usage of non-AUG start co- 
dons may allow for translational control 
that is distinct from canonical ORFs. 

Noncanonical translation has been 
linked to a variety of neurological diseases 
caused by the expansion of short tandem 
repeats in the genome (J4). For example, 
expanded GGC repeats within the 5’'UTR 
of human fragile X mental retardation 1 
(FMRI) mRNA is translated into polygly- 
cine by a uORF-like mechanism (/5). It is 
unclear whether most repeat-associated 
non-AUG translation occurs this way. 
However, the pervasive translation of cy- 
tosolic RNAs and the widespread usage of 
alternative initiation codons in producing 
physiologically functional peptides pro- 
vide a parsimonious explanation for the 
pathological translation events in noncod- 
ing regions. & 
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LIQUID CRYSTALS 


Rings rule three-dimensional 


active matter 


Dynamical defect networks are imaged in viral-particle 
liquid crystals driven by biomotors 


By Denis Bartolo 


e are active matter, in that we move 

and deform on our own without 
resorting to external forces. Dur- 

ing the past 10 years, fluids and 

soft solids have been turned into 

active matter animated by spon- 

taneous flows and deformations, usually 
by seeding a fluid with self-propelled units 
that convert chemical or electric energy into 
directed motion. Animated by spontaneous 
flows and deformations driven from within, 
active fluids are usually powered by dispers- 
ing active particles that 
self-assemble into coherent 
dynamical structures. Ac- 
tive materials—including 
polymer gels, liquid emul- 
sions, or colloidal suspen- 
sions with mesmerizing 
emergent dynamics (J-3)— 
have been mostly limited 
to two-dimensional (2D) 
materials. However, thin 
films are unlikely to host 
more than a small fraction 
of the full complexity of ac- 
tive matter. On page 1120 of 
this issue, Duclos et al. (4) 
report the synthesis and 
the observation of 3D ac- 
tive liquid crystals (see the 
image) and elucidate the 
elementary excitations re- 
sponsible for their complex inner dynamics. 
Nematic liquid crystals are formed of 
elongated molecules or colloidal particles 
that fluctuate in their orientation, but on av- 
erage, all locally point along the same direc- 
tion, described as the director. In ordinary 
passive nematics, shear deformations are 
unhindered as in normal fluids, but bend- 
ing deformations are suppressed by elastic 
stresses in favor of homogeneously aligned 
states (see the figure, top). In stark contrast, 
when a nematic includes active components 
that power internal stresses that distort the 
fluid, the smallest bending deformations can 
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The active 3D nematic was imaged 
with wide-field fluorescent microscopy. 


be exponentially amplified (see the figure, 
top). This generic instability yields unsteady 
bulk flows casually termed “active turbu- 
lence” (5). Unlike turbulence in normal flu- 
ids, the chaotic flows of active nematics are 
devoid of self-similar structures (ones that 
repeat identically at all scales). The flows 
originate from the formation and disappear- 
ance of vortices that have a well-defined size 
set by the competition between active and 
elastic stresses. 

The spatial structure of 2D chaotic vor- 
tices is easily inferred from a static picture 
of the nematic director that defines its local 
orientation. In 2D active 
nematic films, each vortex 
is centered on pointwise 
singularities of the director 
sketched in the figure, top. 
Nematic defects are a sig- 
nal of a finite local wind- 
ing of the director field 
and interact at a distance, 
much like electric charges. 
Topological charge being a 
conserved quantity, no net 
charge can develop in a 
defect-free nematic, and in 
two dimensions, no smooth 
deformation can morph a 
+¥% into a -% charge. This 
topological constraint im- 
plies that active stresses 
can only nucleate pairs of 
oppositely charged defects 
in response to bending instabilities, and in 
two dimensions (see the figure, top). 

The mapping of the unsteady and hetero- 
geneous flows of active nematics on the mo- 
tion of interacting charged particles was the 
key to understanding 2D active turbulence. 
However, in 3D liquid crystals, the nature 
of the fundamental excitations is drastically 
different (6). The director field in 3D nemat- 
ics can host spatially extended topological 
excitations in the form of disclination lines 
and loops. Although these singularities were 
vastly explored in passive systems, they re- 
mained out of reach of active matter until 
the study of Duclos et al., who assembled 
liquid crystals from micrometer-long fila- 
mentous bacteriophage viruses instead of 
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the more typical small molecules 
that from nematic phases. 

These slender biological col- 
loids provide a prototypical nem- 
atic phase at room temperature 
(7). The authors built on earlier 
works from Dogic and co-work- 
ers and dispersed a small frac- 
tion of cross-linked microtubules 
and molecular motors extracted 
from eukaryotic cells into this 
liquid crystal to create an ac- 
tive material (8). When fueled 
with adenosine triphosphate, 
the molecular motors elongated 
the microtubule bundles, which 
in turn generated active stresses 
that drove bending instabilities 
and bulk flows throughout the 


Topological defects in active liquid crystals 

In active matter, internal molecular motors can drive fluid or particle motion and 
amplify responses to external stimuli such as bending forces. Most examples have 
been thin films, but Duclos et a/. now report three-dimensional (3D) active matter, 
a viral particle liquid crystal (LC) driven by embedded microtubule networks. 


oe eo. ea s% eo fF 
o Soe 2]o 2& 
SeseoS=> eee _SoF 
oe om Sores 
oo es Sw aS 


Sy a =o, 
=s< =. bas: eed a 


Passive recovery Bending 


Deformations in thin films 

Bending a nematic LC (middle) distorts the molecular alig 
nematic (left), the orientation along the director simply re 
(right), distortions are exponentially amplified and create 
topological defects. 


Active recovery 


nment. In a passive 
axes back. In an active LC 
pairs of +4 and -% 


observations and _ established 
that the two elementary ring 
singularities originate from 
activity-induced flows, so that 
active hydrodynamic theories 
delivered compelling accuracy. 
However, 3D active turbulence 
cannot be reduced to interact- 
ing-particle models. Quantita- 
tive connections between defect 
topology, loop geometry, and 
flows are only emerging and re- 
main to be elucidated. 

In a field dominated by nu- 
merical simulations and theory, 
Duclos et al. offer a formidable 
experimental platform with 
which to gain a deep insight in 
active matter. One can envision 


entire sample. 

The authors then solved the 
second formidable challenge 
of observing the inner chaotic 
structure of this chimera of 
nematic molecules and motors. 
Elucidating the spatiotemporal 
structure of a 3D active nematic 
must overcome two conflicting 
constraints. One is imaging the 
nematic orientation from mi- 
crometer to millimeter scales, 
which is a time-consuming task, 
and the other is computing the 
spatial-orientation maps with 
high temporal resolution. The 
optical transparency of the ac- 
tive material made it possible 
to resolve this technical conflict 
by taking advantage of multiv- 
iew light-sheet microscopy (9). 
This high-speed, high-resolution 
evolution of selective-plane illumination mi- 
croscopy provided the quantitative measure- 
ments of the local director field over the en- 
tire volume of a 3D active material. The po- 
tential of this state-of-the-art experiment can 
be immediately grasped from the authors’ 
supplementary movies, which also illustrate 
the much faster flows in active nematics. 

As the molecular motors drove activity, a 
living network of disclinations emerged and 
constantly rewired its connectivity (see the 
figure, bottom left, which reveals the regions 
of space where the active-nematic director is 
singular or discontinuous). These living net- 
works of singularities do not reflect built-in 
heterogeneities of the nematic structure. Du- 
clos et al. instead found that the fundamental 
excitations of 3D active nematics are discli- 
nation loops that can nucleate, shrink, open, 
and merge to form spatially extended struc- 
tures out of smooth and homogeneous re- 
gions of space (see the figure, bottom right). 


They located 
(the linear st 


Deformations in 3D LCs 
Duclos et al. used high-resolution 
microscopy to image the director field. 


internally driven bending instabilties. 
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the extended defects 
ructures) formed by 
not in thin films. 


around the singular loops revealed that two 
ring-shape excitations chiefly rule the emer- 
gent distortions in the liquid crystal. 

Both families of disclination loops are 
topologically neutral but are at the core of 
geometrically distinct textures. Wedge-twist 
loops connect sections of spaces hosting the 
familiar +4% and -% defects found in 2D 
nematics (see the figure, bottom right). This 
continuous connection is made possible by 
a smooth 3D rotation of the nematic units 
along the loop. At mid-distance between the 
two planar wedges, the variations of the di- 
rector around the centerline of the disclina- 
tion line forms the equivalent of a Mobius 
strip, a one-sided shape that can be visual- 
ized by taping together the ends of a paper 
band after applying a half-turn twist (70). 

The second and most abundant ring- 
shape singularities are pure-twist loops 
along which Mobius conformations encircle 
the entire defect line. Comparisons with nu- 


A careful analysis of the nematic orientation 
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merical resolutions further confirmed these 
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Defect connections 
The wedge-twist loop connects 
+% and -¥2 defects like a Mobius 
strip. This smooth evolution can 
only occur in 3D materials and 


the synthesis of active smectics 
(that have a layered structure) 
or chiral nematics that have no 
2D counterpart. Another ap- 
pealing prospect would be ma- 
nipulation of disclination net- 
works around solid inclusions. 
Particle-laden liquid crystals are 
not bound to hosting only neu- 
tral topological excitations. For 
example, isolated point charges 
exist in three dimensions, such 
as hedgehogs, in which the di- 
rector orientations resemble 
hedgehog spikes. Activity could 
be put to work to dynamically 
twist, braid, and knot disclina- 
tion lines or to couple them to 
point defects not yet observed 
in the experiments of Duclos et 
al. (6, 11). The number of oppor- 
tunities delivered by the combi- 
nation of a versatile active material and a 
state-of-the art observation tool seems virtu- 
ally unlimited. Last, the production of actual 
volumes of synthetic active nematics is a 
major milestone on the route toward practi- 
cal application of smart active materials. & 
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RETROSPECTIVE 


Frank Press (1924-2020) 


Policy adviser and National Academy of Sciences president 


By Marcia McNutt 


rank Press, geophysicist and two-term 

president of the National Academy of 

Sciences (NAS), died on 29 January. He 

was 95. This soft-spoken son of Jewish 

immigrants advised four U.S. presi- 

dents, built institutions to prominence, 
and provided the scientific guarantee for the 
nuclear test ban treaty. No scientist had more 
impact on the American science-policy inter- 
face in the late 20th century. 

Born in Brooklyn, New York, Press earned 
his bachelor’s degree in physics from City 
College of New York before completing a 
Ph.D. in geophysics at Columbia University 
with the famed geophysicist and oceanogra- 
pher Maurice “Doc” Ewing in 1949. Press was 
appointed to the faculty of Columbia, where 
he developed the Press-Ewing seismom- 
eter and cofounded the Lamont Geological 
Observatory (now the Lamont-Doherty Earth 
Observatory). In 1955, he was recruited to the 
Seismological Laboratory at the California 
Institute of Technology (Caltech), where just 
2 years later he succeeded its founder, Beno 
Gutenberg, to become the director, leapfrog- 
ging more senior seismologists. As a member 
of the international “Conference of Experts” 
in Geneva in 1958, Press convinced world 
leaders that a worldwide seismic network 
could be designed to monitor compliance 
with a nuclear test ban treaty. This assurance 
that such a treaty could be verified was in- 
strumental to its ultimate success. At Caltech, 
Press championed the use of computers to 
digitize the increasing volume of seismic 
data. Along with colleagues, he used this pro- 
cessing power to make the first observational 
detection of the excitation of Earth’s normal 
modes from the 1960 Chilean earthquake. 

In 1965, the Massachusetts Institute of 
Technology (MIT) recruited Press to head 
the Department of Geology and Geophysics. 
Press transformed the classical department 
into a modern-day Department of Earth 
and Planetary Sciences (now known as the 
Department of Earth, Atmospheric, and 
Planetary Sciences), addressing the cutting- 
edge issues of the day. For Neil Armstrong’s 
first walk on the Moon, Press designed a 
seismic experiment to passively detect moon- 


President of the National Academy of Sciences, 
Washington, DC, and chair of the National Research Council 
of the National Academies of Sciences, Engineering, and 
Medicine, Washington, DC. Email: mmcnutt@nas.edu 
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quakes and asteroid impacts. The experi- 
ment returned the first information on the 
internal structure of the Moon and proved 
this concept for probing planetary interiors. 
The Joint Program between MIT and the 
Woods Hole Oceanographic Institution that 
educates graduate students in ocean sciences 
and engineering is one of Press’s more suc- 
cessful innovations during his MIT years. 
President Jimmy Carter appointed Press 
as his science adviser and director of the 
Office of Science and Technology Policy in 
1977. By that time, Press was no stranger 
to Washington, D.C., policy circles. He had 


already served 3 years on the President’s 
Science Advisory Committee under John 
Kennedy and Lyndon Johnson and 6 years 
on the National Science Board under Richard 
Nixon. While in the White House, he champi- 
oned federal investment in research and in- 
ternational cooperation. He created the first 
opportunities for Chinese students to study 
at U.S. universities, a cohort that grew to 
100,000 young scholars in just 5 years. 

In 1981, Press was elected president of the 
NAS. Despite what the research community 
viewed as the Reagan administration’s as- 
saults on science budgets, science-based pol- 
icy, and the environment, Press managed to 
avoid politicizing the NAS and deftly piloted 
its advisory mission to new prominence by 
supplying unassailable advice based on sci- 
ence. For example, the Supreme Court ruled 
in the late 1980s that requiring creationism 
to be taught in public schools along with 
evolution was unconstitutional under the 
First Amendment, influenced by a NAS pub- 
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lication (Science and Creationism, 1984) that 
pointed out that creationism is not supported 
by scientific evidence. In an environmental 
win, a report on acid rain (Acid Deposition: 
Atmospheric Processes in Eastern North 
America, 1983) led the Reagan administra- 
tion to cap the emissions of nitrogen dioxide. 
A year later, Congress amended the Clean Air 
Act to address sulfur dioxide emissions. Press 
was also a defender of the social sciences, 
stating that their research is evidence based 
and that they belong in the NAS. 

Two high-profile consensus _ reports 
changed the course of the nation during 
Press’s NAS tenure. Confronting AIDS (1986) 
spurred a reluctant administration to ac- 
tion to prevent HIV infections from becom- 
ing a nationwide public health epidemic. 
Within a few years, several African nations 
that did not act saw life expectancy fall. In 
1988, Mapping and Sequencing the Human 
Genome launched a project that spawned 
multibillion-dollar industries and has con- 
tributed immeasurably to human health. In 
2000, recognizing that Press had positioned 
the NAS as the most trusted adviser in sci- 
ence for policy, the NAS honored him as its 
first-ever “president emeritus.” 

After his retirement from NAS in 1993, 
Press founded the Washington Advisory 
Group (WAG). WAG worked with, among 
others, the King Abdullah University of 
Science and Technology. Press and _ col- 
leagues helped Saudi Arabia develop a first- 
class research university. 

When Frank Press talked, people listened. 
I was one of those people. I was the final 
hire into “the department that Frank built” 
at MIT before he left to head the NAS. Later, 
he helped recruit me to lead the Monterey 
Bay Aquarium Research Institute for an 
idyllic dozen years, where he served on the 
board. His office was my first stop for advice 
after being asked to lead the U.S. Geological 
Survey in 2009. I was wavering on accept- 
ing the Washington, D.C., position because 
it would require a long separation from my 
family in California. When Press learned that 
I would also be science adviser to the secre- 
tary of the interior, he was adamant that I ac- 
cept. Press saw an important opportunity to 
ensure that science would have a direct route 
into Cabinet-level decisions. He made me re- 
alize that choices were no longer about me 
but rather what was best for the nation. 

I also followed Press’s advice to benefit 
from the fresh perspectives of new colleagues 
and a new mission by changing positions 
frequently. I could not be more fortunate to 
have had such an amazing mentor choose 
me. His passing is a loss not just for the na- 
tion but for the many he inspired. 
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RESEARCH ETHICS AND LAW 


Overcoming obstacles to 
experiments in legal practice 


The biggest ethical concern is failing to 
seek reliable evidence to guide the legal profession 


By H. Fernandez Lynch,? D. J. Greiner, 
I. G. Cohen? 


he importance of evidence-based pol- 

icy rooted in experimental methods 

is increasingly recognized, from the 

Oregon Medicaid experiment to the 

efforts to address global poverty that 

were awarded a 2019 Nobel Prize. Over 
the past several decades, there have been at- 
tempts to extend this scientific approach to 
legal systems and practice. Yet, despite prog- 
ress in empirical legal studies and 
experimental social policy research, 
judges, lawyers, and legal services 
providers often fail to subject their 
own practices to empirical study or 
to be guided by empirical data, with 
a particular aversion to randomized 
controlled trials (RCTs) (1). This is 
troubling, as many questions fun- 
damental to legal practice and those 
it affects, such as allocation of attor- 
ney services, bail decisions, and use 
of mandatory mediation, could and 
should be informed by a rigorous 
evidentiary foundation. Although 
there are practical obstacles to un- 
dertaking legal practice RCTs, they 
have also been stymied by cultural 
barriers within the legal profes- 
sion. Whereas medical practitioners 
are expected to subject their practices to 
randomized study and rely on the data pro- 
duced, the default in legal practice is to rely 
on experience, common wisdom, and profes- 
sional judgment, often in settings in which 
clients face constraints on their freedom. To 
address ethical concerns stemming from this 
cultural difference, we draw on lessons from 
biomedical and policy research, as well as ex- 
periences of the Access to Justice Lab (A2JL), 
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a leading institution promoting the use of 
RCTs to inform legal practice. 

RCTs in law face funding challenges, a 
dearth of lawyers trained in experimental 
methods, and blanket roll-out of new laws 
and legal interventions across entire popula- 
tions in a manner that often precludes ran- 
domization. But the most fundamental chal- 
lenges are ethical. When field RCTs of legal 
practice interventions have been proposed, 
practitioners have sometimes raised con- 
cerns about violating both perceived profes- 


sional obligations and the individual rights 
of research subjects. Institutional review 
boards (IRBs), responsible for assessing the 
permissibility of research, also sometimes 
exhibit difficulty applying familiar research 
ethics concepts to nonmedical settings, a 
phenomenon exhibited toward social science 
research more broadly. Ethical objections 
have posed real barriers to A2JL’s work (see 
the table), which have occasionally been re- 
solved through education and negotiation, 
but have more often led to abandonment of 
important, ethical studies. 

There have been growing calls to system- 
atize and expand efforts to subject legislative 
and regulatory interventions to randomized 
experimentation (2), as well as increasing at- 
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tention to the ethics of policy experiments 
more broadly (3-6). We build on this work, 
but because our goal is to highlight objec- 
tions rooted in concerns over randomization 
in lieu of the individual judgments of legal 
professionals, the principles of biomedical 
research ethics are the best starting point 
for potential solutions. The medical profes- 
sion has been able to manage many ethical 
concerns raised by RCTs and to create an 
evidence-based culture, a portent of what the 
legal profession could achieve. The ethics of 
clinical research have also developed in a way 
that addresses the traditional fiduciary obli- 
gations of physicians, whereas perspectives 
regarding the obligations of legal practitio- 
ners seem to be generating erroneous resis- 
tance to legal practice RCTs. 

Although there are differences between 
medical and legal settings, including the 
role of democratic authority, protected legal 
rights, and the unavoidably coercive nature 
of the criminal justice system, many ethical 
challenges facing field RCTs of legal practice 
interventions, including those related to ran- 
domization and consent, are familiar in bio- 
medical research. Therefore, we seek to apply 
lessons from that context to law— 
and to identify novel ethical consid- 
erations where further attention is 
warranted. Research ethics regula- 
tions will not always apply to legal 
practice RCTs, depending on insti- 
tutions and funding. Nonetheless, 
public trust, researcher obligations, 
and pragmatic considerations all de- 
mand that ethical principles be both 
clarified and upheld. 


EVIDENCE-BASED LEGAL PRACTICE 
The core idea of evidence-based 
medicine is that because few medi- 
cal interventions are so obviously 
safe and effective that they can be 
adopted without testing, patient 
care should be guided by scientific 
evidence. Although observational 
studies and historical controls can be superior 
to anecdote, randomization and contempora- 
neous controls help minimize confounders 
and support causal conclusions. As a result, 
RCTs are rightly regarded by the medical pro- 
fession as essential to quality care, albeit with 
their own methodological constraints. 

These same concepts apply to legal prac- 
tice but have not been widely endorsed by 
the legal profession, even when stakes are 
high, with implications for freedom (e.g., 
incarceration), safety (e.g., protection from 
domestic violence), and access to resources 
essential to fundamental human needs (e.g., 
legal assistance in securing government ben- 
efits). Although legal practitioners may rely 
on certain guidelines when available, these 
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Examples of legal practice research questions, RCT designs, objections, and replies 


QUESTION 


STUDY DESIGN 


OBJECTIONS 


RESPONSE 


Does availability of risk 
assessment scores improve 
judicial bail decisions? 


Randomize arrestees as to whether 
bail judge receives their risk 
assessment score 


Too much at stake 
(freedom v. incarceration) 


Voluntary consent from arrestees 


not practicable 


High stakes demand empirical study 

Equipoise as to utility of risk assessment scores 
Either practice (scores or no scores) is permissible 
without consent outside research; no entitlement to 
either policy 


Can legal professionals 
effectively allocate scarce 
resources for greatest benefit? 


Randomize criminal defendants to 
have public defender decide whether 
to assign a social worker or allocate 
services by some other process 


Unfair to defendants who would 
have been assigned a social worker 
but for the study 


Scarcity of social workers 


Equipoise as to quality of attorney allocation decisions 
Either practice (public defender triageor 
other approach) permissible outside research; 

no entitlement to either a particular triage process 

or social work assistance 


Does a combative litigation 
style produce superior client 
results compared to a 
facilitative, deal-making style? 


Randomize whether legal aid 
attorneys in summary eviction 
cases litigate aggressively (e.g., 
demand jury trials, file motions, 


Contrary to responsibility of 
attorneys to use their best 
professional judgment about ideal 
strategy for each individual case 


make ambitious settlement 
demands) or nonaggressively 


are also often not evidence-based, being in- 
stead informed by politics or the subjective 
collective wisdom of professional authorities. 
Even when evidence is available from one of 
the few dozen legal practice RCTs conducted 
in the United States to date, and even when 
that evidence fails to support the use of inter- 
ventions that are costlier than their alterna- 
tives, it has not been used to change practice 
(1). When aspects of legal practice are rooted 
in philosophical and normative claims about 
rights and interests, empirical study may 
have little to add. But when interventions aim 
to achieve certain instrumental outcomes, as 
is often the case, evidence is critical (J, 2). 


RANDOMIZATION 

One commonly articulated objection to le- 
gal practice RCTs is that it is unethical to 
randomize legal interventions in a way that 
temporarily precludes individualized profes- 
sional judgment. For example, in objecting 
to a proposed A2JL study in which tenants 
facing eviction litigation would be random- 
ized to participate in a mandatory pretrial 
settlement conference or proceed without 
one, local legal services providers argued on 
the basis of their anecdotal experience that 
settlement conferences assist in keeping low- 
income tenants in their homes, despite rea- 
son to question that assumption. Similarly, 
organizations running charitable bail funds 
have refused to discuss RCTs, even when they 
lack sufficient resources to assist all eligible 
arrestees, arguing that because they “already 
know” the difference it makes to have bail 
posted, it is wrong to randomize “lifesaving” 
legal “treatments.” 

These examples are indicative, in part, of 
the legal profession’s lagging understanding 
of foundational principles of research design 
and ethics. Similar objections have long been 
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considered resolved in the realm of clini- 
cal research, where one might also be con- 
cerned that randomized participants could 
be harmed by the denial of some perceived 
benefit in a way that violates the physician’s 
professional obligation to provide individual- 
ized care (7). However, an RCT can be ethi- 
cal—and perhaps ethically mandatory—when 
there is sufficient professional uncertainty, or 
“equipoise,” regarding the safety or efficacy of 
the experimental intervention compared to 
the standard of care. In these cases, random- 
ization violates no professional obligation be- 
cause it will not result in practitioners know- 
ingly exposing participants to harm. 

In addition to the culture shift needed 
in law to adopt this view, it is also essential 
to challenge faulty moral intuitions exhib- 
ited in the legal profession and elsewhere 
that it is permissible to roll out an untested 
intervention across an entire population, 
but not to randomize population groups to 
rigorously examine the intervention’s ef- 
fect (8). Moreover, because resolving extant 
uncertainty is essential to the professional’s 
fiduciary obligation to provide the best care 
or services, it would be logically incoherent 
to refuse to subject unproven approaches to 
study unless or until evidence is produced to 
challenge their effectiveness. 

Even when there is relative certainty as 
to which intervention to prefer in ideal cir- 
cumstances, practical constraints may ren- 
der that intervention neither attainable nor 
sustainable for some or all of the relevant 
population (5, 9). In such cases, researchers 
should work to resolve “policy equipoise” (5), 
developing evidence needed to inform open 
questions, such as how to optimally allocate 
the intervention and whether a less resource- 
intensive approach might suffice. Doing this 
work through RCTs is methodologically ideal 
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Equipoise as to ideal litigation strategy 


both strategies within the “standard of care” 


and in many cases ethically permissible. 
However, one further consideration particu- 
larly relevant to legal practice interventions is 
that the ethical permissibility of an RCT that 
fails to provide all subjects with the preferred 
intervention depends on the absence of legal 
or ethical entitlement to that intervention. 
Although patients may have ethical entitle- 
ments, they have few legal entitlements to 
medical services, at least in the United States. 
Those affected by legal practice interventions 
may have stronger claims. 

Consider an offer of representation by a 
lawyer. This is a federal constitutional en- 
titlement in the United States only for seri- 
ous criminal charges and a few other narrow 
matters; in these contexts, it would be legally 
impermissible to withhold the offer from 
some individuals, even for research purposes. 
However, there are other circumstances in 
which an individual may be expected to 
benefit from an offer of legal representation 
without being legally entitled to it. In the face 
of scarcity, a legal clinic could reasonably 
randomize similarly situated individuals to 
receive either an offer of traditional repre- 
sentation, an offer of self-help resources only, 
or no offer at all to research how to maximize 
benefits across the population. Individuals 
randomized without entitlement have either 
not been harmed, because they might not 
have received the benefit outside the RCT, or 
have not been unreasonably harmed, because 
even if they would have received the benefit, 
they lack a greater claim to it than others fac- 
ing similar need. 


VOLUNTARY CONSENT 

Other objections to legal practice RCTs are 
based on concerns that RCTs may be inher- 
ently coercive. Perceived or actual coercive- 
ness might stem from the involvement of 
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lawyers, who like physicians are often in a 
position of power compared with prospec- 
tive subjects; the provision of desired legal 
services only to study participants (or select 
participants); or the constraints on freedom 
present in criminal settings. 

Concerns about professional power dy- 
namics can be addressed in part by em- 
phasizing, where true, that participation 
in research is voluntary and refusal will 
involve “no penalty or loss of benefits to 
which the subject is otherwise entitled” (0). 
Nonetheless, potential subjects may feel 
pressured when research is the only way to 
access a desired intervention, a concern that 
has been a persistent point of confusion in 
research ethics. This pressure is not coercive 
absent entitlement to the desired interven- 
tion (17), and it is not unduly influential if 
the risks of participation are reasonable and 
not so great as to outweigh the benefits of 
the desired intervention (72). In fact, where 
research risks are reasonable and there is 
the prospect of direct benefit to subjects, one 
might worry less about constraints on volun- 
tary consent because research participation 
will be objectively reasonable. 

This is important because one distinctive 
feature of legal practice RCTs is the inher- 
ent involuntariness of some aspects of the 
legal system, particularly in the context of 
criminal justice. Unlike medicine, where 
there is an expectation of informed consent 
to justify interventions on a patient’s body, 
individuals who have been arrested, are 
facing prosecution, or have been convicted 
have little control over what happens to 
them. At least partly in recognition of these 
circumstances, U.S. research regulations 
demand special protections for individuals 
involuntarily confined or detained in penal 
institutions (10), which can also guide re- 
search in other legal settings. 

To the extent voluntariness concerns per- 
sist, further analysis is warranted to evalu- 
ate whether securing traditional research 
consent is a reasonable expectation for 
all legal practice research interventions. 
Current norms permit consent waiver only 
for minimal risk research that would be im- 
practical to conduct if consent were required 
(10). This would likely cover RCTs assess- 
ing mandatory mediation or online dispute 
resolution, for example, but “minimal risk” 
is challenging to define and the subject of 
intense debate in research ethics, especially 
when evaluating practices that may be part 
of the existing standard of care, despite re- 
maining unproven (73). And at least some 
legal practice interventions will pose greater 
than minimal risk, such as a heightened pos- 
sibility of incarceration (e.g., for arrest, bail, 
and sentencing interventions) or inability to 
access government benefits (e.g., for compar- 
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isons of self-help models versus assistance of 
counsel). The response to concerns about 
voluntary consent must not be to inhibit le- 
gal practice RCTs, however, because without 
them, potentially harmful legal interven- 
tions may be adopted and potentially benefi- 
cial interventions may be dismissed without 
either evidence or consent (8). 

Similar questions arise in circumstances 
in which it may be infeasible to secure in- 
dividual consent, a challenge faced across 
medical, policy, and legal practice interven- 
tion research. For example, it may not be 
possible to study an intervention without 
mandatory application to some group, such 
as use of a sentencing algorithm to substi- 
tute for judicial discretion. Similarly, use of 
randomized cluster design can sometimes 
preclude individual consent, such as when 
different legal services clinics are each ran- 
domized to offer only a single approach, 
removing choice from each clinic’s clients 
unless they seek services elsewhere. It is 
important to recognize that these interven- 
tions could be implemented without con- 
sent outside a research context (and some- 
times already have been), which may render 
the inability to secure research consent less 
ethically concerning (2, 8). Nevertheless, 
alternative protections may be appropriate. 
For example, the data collection required 
for research purposes should not unduly 
infringe participants’ autonomy and pri- 
vacy rights (6), and research should avoid 
disproportionately testing interventions 
designed to justify reducing resources allo- 
cated to vulnerable populations (14). 


BYSTANDERS 

A final concern that may arise for legal prac- 
tice RCTs, as well as other types of studies, 
centers on their potential impact on both 
incidental bystanders and intended third- 
party beneficiaries, an understudied area in 
research ethics generally (75). For example, 
research interventions regarding bail deci- 
sions might be intended to achieve public 
safety goals, to advance the interests of ar- 
restees, or both. Where third parties are 
affected by research but not directly inter- 
vened upon, there are further questions of 
consent. They may not be individually iden- 
tifiable in advance (e.g., the future victim of 
an arrestee released on bail due to a study 
intervention), making their consent impos- 
sible. Even if bystanders are identifiable, 
however, it is unclear that they should be 
given the authority to withhold consent and 
preclude research that offers the prospect 
of direct benefit to research subjects and 
future individuals like them (e.g., arrestees 
that may benefit from a bail decision). Yet, 
to the extent bystanders face research risks, 
those risks seem worthy of consideration 
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when assessing a study’s ethical permissi- 
bility. Conversely, although we traditionally 
consider risks to research subjects, where 
research interventions are intended to pro- 
duce social rather than individual benefit, 
and the intervention does not entail any 
breach of a subject’s rights or entitlements, 
it is not clear that subjects should be al- 
lowed to withhold their consent. Although 
IRBs are directed to consider benefits that 
extend beyond study participants when as- 
sessing a study’s risk-benefit profile, they 
have not typically considered risks to by- 
standers. This area deserves further atten- 
tion in both law and medicine. 


CONCLUSION 

Although objections to legal practice RCTs 
often have been framed in ethical terms re- 
lated to randomization and consent, the big- 
gest ethical challenge lies in failing to seek 
reliable answers to guide the legal profes- 
sion. The profession—and IRBs—should 
embrace RCTs, while being thoughtful about 
their methodological limits and devoting 
further attention to unresolved issues, in- 
cluding which legal interventions should 
be viewed as entitlements; when research 
consent should be required; how to address 
bystander interests; which RCTs should be 
prioritized; when it may be appropriate to 
proceed with a promising intervention in the 
absence of confirmatory evidence; and how 
to best engage with communities that would 
be affected by this research. 
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Evolution makes a splash 


An underwater arms race drives a vibrant new game 


By Angela Chuang? and Orlando Schwery? 


orth Star Games’ new board game 

Oceans simulates marine evolution, 

modeling the fierce struggle for exis- 

tence among competing species in an 

“eat or be eaten” environment. It does 

this by enabling players to create spe- 
cies whose feeding strategies are dictated 
by one of 12 trait cards. “Filter feeders,” 
for example, forage exclusively from the 
fish tokens in the game’s reef zone. “Apex 
predators,” on the other hand, must suc- 
cessfully attack other players’ species. Play- 
ers can react with defensive behaviors, such 
as “inking” and “schooling,” which, in turn, 
create pressure on other players to pro- 
duce predators with ever-stronger attacks, 
in a coevolutionary arms race. The food 
tokens a player earns from feeding attempts 
are added to the individual board of each 
species and are eventually converted into 
the final points that determine victory at 
the game’s conclusion. 
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Oceans is the latest addition to the highly 
praised North Star Games Evolution series 
(1-3). Veterans of the earlier games in that 
series will recognize Catherine Hamilton’s 
mesmerizing illustrations on the box and 
on the game’s main (“surface”) deck of 
cards. Her vibrant watercolors 
simultaneously capture the el- 
egance of real species’ forms 
while leaving room for players’ 
imaginations to complete the 
canvas of their new creations. 
The “deep” deck, however, 
which comprises 89 unique 
abilities, includes designs from 


Increasingly powerful beasts emerge 
from Oceans’ deeper waters. 


between and among players’ species. For 
instance, “parasitic” creatures leech food to- 
kens from adjacent species, whereas “shark 
cleaners” gain tokens when a large attack 
has been made. A single feeding event can 
trigger a cascade of effects among a spe- 
cies’s symbionts, highlighting the intercon- 
nectedness of life in such ecosystems and 
illustrating the vital role of keystone spe- 
cies. Abrupt changes to these species send 
reverberations across the community and 
spell extinction for those that cannot evolve 
fast enough. Players are also discouraged 
from allowing their species to indulge in 
unbridled feeding frenzies: Those that gain 
too many fish tokens overpopulate, and 
their numbers collapse by half. 

Every game of Oceans features an event 
known as the Cambrian Explosion, named 
for the period in Earth’s history characterized 
by rapid speciation rates and a burst of mor- 
phological innovation. Here, players use dou- 
ble the amount of trait cards per turn, thus 
hastening the evolutionary turnover of traits. 
This second half of the game also requires 
species to consume more food per round to 
survive, intensifying the game’s tempo. 

While it is possible to play the game using 
only the 12 surface trait cards, the game gets 
really weird (in a good way) if players choose 
to add in the cards from the deep deck after 
the Cambrian Explosion. This deck features 
powerful and mysterious cards that lean into 
science fiction and embrace the unknown 
lurking in our thalassic depths, as deep- 
sea krakens, leviathans, and behemoths 
rear their heads. It also features a range of 
real, magnificent, and game-changing traits 
found in the deep sea, such as 
intelligence, bioluminescence, 
and coprophagia (yes, the game 
goes there). It is worth not- 
ing that gentle giants can also 
evolve with great success, lest 
it seem as though the game is 
biased toward aggressive strate- 
gies at this phase. 


other talented artists, lend- Oceans Oceans invites players to mar- 
ing to their distinctive feel. Nick Bentley, vel at the rich diversity of ocean 

The game preserves one of the Dominic Crapuchettes, life, from the vivacious reef to 
key elements of its predecessors, scr the mysterious ocean depths. 


illustrating trade-offs associated 
with certain feeding strategies. 
Food tokens are limited, and either play- 
ers must keep the reef well stocked, or spe- 
cies that feed from the reef must eventually 
adapt to life in deeper habitats. But Oceans 
really ups the ante when it comes to species 
and player interactions. 

The game cleverly incorporates a host of 
traits that create symbiotic relationships 
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Building on the prior successes 
of the series, it packages all this 
into a fast-paced and dynamic strategy game 
with high replay value. & 
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The landmark case that almost never was 


A law professor investigates the legal decision to regulate U.S. greenhouse gases 


By Michael B. Gerrard 


he U.S. Supreme Court’s 2007 deci- 

sion in Massachusetts v. Environ- 

mental Protection Agency is widely 

seen as the most important U.S. envi- 

ronmental ruling of all time. But the 

suit, which led to a ruling that the 
Clean Air Act of 1970 empowered the U.S. 
Environmental Protection Agency (EPA) 
to regulate greenhouse gases, was almost 
never brought. Richard J. Lazarus’s won- 
derful new book, The Rule of Five, is the 
inside story of how this case came to be, 
how its lawyers struggled and 
fought over theories and roles, 
and how the late Justice John 
Paul Stevens patched together 
the five votes needed to secure 
a majority. 

Lazarus is ideally suited to 
tell this story. A law profes- 
sor at Harvard, he has repre- 
sented the government and 
environmental groups in 40 
Supreme Court cases and pre- 
sented oral argument in 14. He 
also roomed with John Roberts 
during law school, and they 
later taught a course on the 
history of the Supreme Court 
together. For this project, he 
interviewed almost everyone 
involved in the case, including 
several of the judges and jus- 
tices and (with their permis- 
sion) their law clerks, and had 
access to internal memos and 
drafts of opinions. 

The case in question was the 
brainchild of Joe Mendelson, 
a lawyer working for a little- 
known environmental group in Washing- 
ton, D.C., called the International Center 
for Technology Assessment. In 1998, he 
drafted a 35-page petition to the EPA, ar- 
guing that it was the agency’s responsibil- 
ity to regulate carbon dioxide and other 
greenhouse gases as pollutants, and put it 
in a drawer, awaiting the right time to file. 
A year later he decided that the time had 


SS 


The reviewer is at the Sabin Center for Climate Change Law, 
Columbia Law School, New York, NY 10027, USA, and is 

the co-editor of Legal Pathways to Deep Decarbonization in 
the United States (Environmental Law Institute, 2019). 
Email: mgerra@law.columbia.edu 


1082 6 MARCH 2020 « VOL 367 ISSUE 6482 


come, but leaders of other environmental 
organizations pleaded and then pressured 
him not to file, thinking it would be bet- 
ter to wait for the expected ascension of a 
climate champion, Vice President Al Gore, 
to the presidency in January 2001. Mendel- 
son filed anyway. 

The EPA sat on the petition until after 
the November 2000 election. When George 
W. Bush took office, the agency’s newly 
instated political appointees fumbled the 
response in a way that made it easier to 
challenge in court. In 2013, 30 parties filed 
suits in the U.S. Court of Appeals for the 


~ 
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The Massachusetts v. EPA decision empowered the EPA to regulate domestic 
greenhouse gases. 


District of Columbia, but the court dis- 
missed them in a 2-to-1 split. 

There are always long odds when one 
seeks to persuade the Supreme Court to 
take a case, but they are even longer when 
a side has legions of squabbling lawyers 
with different legal theories and just one 
chance to file a brief. But Lisa Heinzerling, 
a professor at Georgetown Law School, 
wrote what all agreed was a brilliant brief 
on behalf of the petitioners, and—to every- 
one’s shock—the Supreme Court took the 
case. Then began the fight over who would 
get to argue it. 
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The Rule of Five: : THe RULE 
Making Climate History OF PAV Ea 
at the Supreme Court 


Richard J. Lazarus 
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This honor ultimately fell to James Milkey, 
a lawyer in the Massachusetts Attorney Gen- 
eral’s Office who had never argued before the 
Supreme Court. As they had done with Men- 
delson, many other lawyers working on the 
case tried to get Milkey to step aside, claiming 
that he was not up to the task. 
But Milkey persisted, and—with 
the help of additional brilliant 
briefs led by Heinzerling—on 
argument day, he shone. In the 
months that followed, Justice 
Stevens found ways to persuade 
his colleague Anthony Kennedy 
to give him the crucial fifth vote 
needed to prevail. 

Lazarus walks readers 
through all of the procedural 
steps and legal theories that 
surrounded this case, using lu- 
cid prose that is easy for non- 
lawyers to follow. The book is 
a master class in how the Su- 
preme Court works and, more 
broadly, how major cases navi- 
gate through the legal system. 

The Massachusetts v. EPA 
decision was the basis for 
almost all of the Obama ad- 
ministration’s actions on cli- 
mate change, which included 
stricter regulations of emis- 
sions from motor vehicles and 
power plants. These actions, in 
turn, gave President Obama the legitimacy 
to press fellow world leaders to reach a 
landmark agreement on climate change in 
Paris in December 2015. However, in Febru- 
ary 2016, the Supreme Court put Obama’s 
Clean Power Plan on hold, and President 
Trump has pressed to repeal it, to weaken 
the motor vehicle rule, and to pull the 
United States out of the Paris Agreement. 
As Lazarus rightly concludes, the most im- 
portant legal decisions are made not in the 
courtroom, but at the ballot box. 


10.1126/science.aba8125 


sciencemag.org SCIENCE 


PHOTO: FENG CHENG/SHUTTERSTOCK.COM 


PHOTO: ROBERT M. KOOYMAN 


; : = e 
CH me ; = aA 


Edited by Jennifer Sills 


Protect Australia’s 
Gondwana Rainforests 


Recent fires in the Gondwana Rainforests 
of eastern Australia, a UN Educational, 
Scientific and Cultural Organization 
(UNESCO) World Heritage site (7), exposed 
the country’s inability to protect these 
unique forests. Rich in diversity and global 
fossil heritage, the Gondwana Rainforests 
harbor the highest concentrations of 
threatened species in subtropical south- 
east Queensland and northern New South 
Wales, and they protect more than 40 mil- 
lion years of globally significant rainforest 
evolutionary history (2-9). Australia must 
take steps to ensure that these forests will 
not be lost in future natural disasters. 
Australia began to break from Gondwana 
(Antarctica) around 40 million years 
ago (3), carrying its remnant of Austral 
paleorainforest and deep time evolutionary 
history. On its journey north, the Australian 
continent escaped the ravages of Antarctic 
freezing, only to heat up and dry out much 
later as it approached the tropics (4, 5). 
The Paleo-Antarctic rainforest lineages 
(PARLs) retreated to the few remaining, 
geographically restricted and still shrink- 
ing wet places (2, 4). PARLs are living plant 
taxa with fossil records in the mid-high 
latitude paleorainforests of the Cretaceous 
and Paleogene Southern Hemisphere (2, 
3), and their existence in Australia’s living 
World Heritage Gondwana Rainforests 
preserves a vestige of the mostly vanished, 
late-Gondwana rainforest ecosystems that 
once covered much of the southern half of 
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the planet (4, 6—9). The greatest concentra- 
tion of threatened rainforest plant species 
in Australia, many of them PARLs, now 
resides in the Nightcap Range in northern 
New South Wales (10). Efforts to protect this 
incredible concentration of diversity have 
proven inadequate (77). 

Although the fires are now under 
control, more than 50% of the Gondwana 
Rainforests were affected by them (11). 
Australia must learn from this experience. 
Before the next threat arrives, Australian 
state governments should identify natural 
assets that are at risk and put policies in 
place to improve fire planning, streamline 
fire responses, and protect these forests. 
Legislation should prioritize areas of high 
conservation value, implement strategic 
fire planning to protect the forests, and 
avoid implementing broad-area indus- 
trial-scale controlled burns. Meanwhile, 
scientists must work to generate baseline 
scientific data for these areas immediately, 
which can be used to assess the damage 
from the recent fires and better predict fire 
behavior in the future. If policy-makers 
fail to heed the lessons of the recent fires, 
the world could lose Australia’s ancient 
Gondwana Rainforests. 
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Post-2020 goals overlook 
genetic diversity 


In January, the secretariat of the 
Convention on Biological Diversity (CBD) 
released the first draft of a post-2020 
global biodiversity framework with goals 
and targets for biodiversity (1, 2). We are 
deeply concerned that the goal suggested 
for genetic diversity—the basic element 
for evolutionary processes and all biologi- 
cal diversity—is weak. Abundant scientific 
evidence recognizes the crucial role of 
intraspecific genetic diversity in ecosystem 
resilience, species survival, and adaptation, 
especially under increased threats of cli- 
mate change, habitat loss, and diseases (3). 
The new goals should correct omissions in 
the previous strategy document. 

The previous biodiversity strategy, CBD 
2011-2020, includes Aichi Target 13 on 
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A Royal Society Summer Science Exhibition attendee learns about the helium atom microscope. 
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Outreach that sticks 


| stood in our booth at the Royal Society's 215th Summer Science Exhibition, eagerly 
awaiting a chance to share with the attendees the hands-on activities describing the 
physics of our cutting-edge research—the result of a collaboration between two groups 
half a world apart. A group of three teenagers hesitantly approached the exhibit, 
nervous as to whether to engage. “Would you like to try our challenge?” | asked. “Match 
the pictures of the object to the image from our microscope.’ Immediately, their appre- 
hension vanished, and they set about the task. 
| watched as they matched macroscopic images of a honeybee and a spider to a 
close-up of the hairs on the bee's eye and the fang in the spider’s tiny mouth. When the 
students finished analyzing the images, | explained that the close-up photographs were 
created using a novel helium atom microscope, which shows 
aa extraordinary details at micrometer scale without damaging 
Call for submissions : : 

; : the sample. By then, they had overcome their preconceptions 
Outside the Tower is ee : : : i 
anogcasional featire about the difficulty of physics, and their questions flowed: “How 
highlighting scientists’ ee these images mle “Why is this microscope different?” 

What can it be used for?” Their natural scientific curiosity and 


advocacy experiences. : ee 
Submit your advocacy confidence had been revitalized. 
story at http://cts. Our exhibit accomplished our mission for the day—to engage 


an audience ranging from young children to distinguished 
Fellows of the Society. However, it was only the first step to life- 
long learning in science, technology, engineering, and mathematics (STEM). Although 
one-off interventions can inspire, the lack of participation in STEM will not be solved 
just by increasing interest (1). 

To boost the long-term effects of our outreach efforts, we sent students and teach- 
ers on their way with curriculum-linked resources provided through an online program 
(2). With hints, videos, and teacher support available online, they could solve problems 
related to the exhibition and make connections between science lessons in school 
and a career in research. We hope that our model of inspiration followed by long-term 
engagement will allow students to experience the euphoria of success and the resil- 
ience needed to become the researchers of the future. 
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genetic diversity, which focuses on “culti- 
vated plants and farmed and domesticated 
animals” and their wild relatives. Indicators 
associated with Target 13 follow trends, 
number, and threat status of domestic 
animal breeds and crops (4). Although the 
post-2020 CBD draft includes a much- 
needed goal to maintain genetic diversity, 
it does not explicitly state that genetic 
diversity maintenance is crucial for all spe- 
cies, not just a few. Because no indicators 
to follow trends of genetic diversity of wild 
animals and plants are suggested in the 
draft, genetic diversity could continue to be 
considered only for domestic organisms, as 
it was under Target 13. 

The newly proposed framework should 
incorporate several revisions before it is 
finalized. The post-2020 framework should 
explicitly commit to maintaining genetic 
diversity within all species and to imple- 
menting strategies to halt genetic erosion 
and preserve adaptive potential of popula- 
tions of both wild and domesticated species. 
The framework should also define indica- 
tors of progress toward this goal (5). Such 
indicators could include collecting data 
on the number of species, populations, or 
metapopulations that are large enough to 
maintain genetic diversity as well as those 
that are not. A widely used measure in this 
context is the “genetically effective popula- 
tion size,” which quantifies the rate at which 
a population loses genetic variation. When 
the effective size is measured as 500 “ideal 
individuals,’ the population is considered 
“genetically safe” (6, 7). We therefore sug- 
gest monitoring the number of populations 
above and below the genetically effective 
size of 500. The effective size is assessed 
from genetic or demographic data and is 
usually much lower—by about an order of 
magnitude (8)—than the total number of 
mature individuals. Another indicator could 
be the number of species or populations in 
which genetic diversity is being monitored 
by national agencies or universities using 
DNA-markers. A third indicator could be 
measuring rates of loss of distinct popula- 
tions within species. 

It is encouraging that the CBD post-2020 
draft includes genetic diversity in one of 
the five main goals. However, including 
explicit protection for genetic diversity 
in wild as well as domestic species, and 
strategies to measure the effectiveness of 
efforts toward that goal, will ensure that 
signatories prioritize this important aspect 
of biodiversity conservation. 
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SOLAR CELLS 


Edited by Michael Funk 


Growing perovskite on textured silicon 


ide—band gap perovskites could boost the efficiency of 
silicon solar cells by forming tandem cells, but usually the 
perovskite must be grown on a smoothed side of the silicon 
cell because the material grown on the rough light-trapping 
side often does not fully coat the silicon surface and its 
rough texture is prone to phase separation. Hou et al. grew thick 
films of a perovskite with a band gap of ~1.68 electron volts and 


Gauge invariance 
with cold atoms 


There is considerable inter- 

est in developing quantum 
computational technologies 
that can simulate a series of 
physical phenomena inacces- 
sible by classical computers. 
Mil et al. propose a modular 
scheme for quantum simulation 
of a U(1) lattice gauge theory 
based on heteronuclear spin- 
changing collisions in a mixture 
of two bosonic quantum gases 
isolated in single wells of a one- 
dimensional optical lattice. They 
engineered the elementary 
building block for a single well 
and demonstrate its reliable 
operation that preserves the 
gauge invariance. The potential 
for scalability of the proposed 
scheme opens up opportuni- 
ties to address challenges 
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in quantum simulating the 
continuum limit of the gauge 
theories. —YS 

Science, this issue p. 1128 


Clearing a path for 
an HIV cure 


Curing HIV is more feasible if we 
better understand the forma- 
tion of the latent viral reservoir. 
Leyre et al. studied longitudi- 
nal blood and tissue samples 
obtained from individuals during 
acute HIV infection. The large 
pool of infected cells present 

at the early stages of infection 
did not survive antiretroviral 
treatment. Infected cells at later 
stages were more prevalent in 
lymphoid tissues than in blood. 
These results show that most 
infected targets are cleared 

with early treatment, but 
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Hage et al., p. 1124 


used a passivant, 1-butanethiol, to limit its phase separation. The 
tandem cells had a certified power conversion efficiency of 
25.7% and had negligible losses after 400 hours of operation. 


—PDS Science, this issue p. 1135 


Cross-sectional scanning electron microscope image of textured crystalline 
silicon, which can form the bottom cell in perovskite-silicon tandem solar cells 


HIV-positive cells that remain 
after a certain stage are more 
likely to persist. These persis- 
tent cells are the ones that will 
need to be eradicated for a cure 
to be achieved. —LP 


Sci. Transl. Med. 12, eaav3491 (2020). 


Expanding the human 
proteome 


Using mass spectrometry, 
ribosome profiling, and several 
CRISPR-based screens, Chen 
et al. identified hundreds of 
previously uncharacterized 
functional micropeptides in 
the human genome (see the 
Perspective by Wei and Guo). 
Protein translation outside of 
annotated open reading frames 
(ORFs) in messenger RNAs and 
within ORFs in long noncoding 
RNAs is pervasive. A functional 
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screen using CRISPR-Cas9 
with single-cell transcriptomics 
suggested critical roles for 
hundreds of micropeptides. 
Micropeptides encoded by 
multiple short, upstream ORFs 
form stable protein complexes 
with the downstream canonical 
proteins encoded on the same 
messenger RNAs. —SYM 
Science, this issue p. 1140; 
see also p. 1074 


RNA life span at 
single-cell resolution 


RNA transcripts are an eas- 
ily accessed representation of 
gene expression, but we lack a 
comprehensive view of the life 
span of RNA within the single 
cell. Battich et al. developed 

a method to sequence mes- 
senger RNA labeled with 
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5-ethynyl-uridine (EU) in single 
cells (ScEU-seq), which allows 
estimation of RNA transcrip- 
tion and degradation rates. 
When examining intestinal 
organoid cells, scEU-seq data 
can be used to discern between 
transcription and degradation 
during development, indicat- 
ing that this method can be 
applied to better understand 
the relationship between gene 
expression and RNA degrada- 
tion during development. —LMZ 
Science, this issue p. 1151 


CELL SURFACE MAPPING 


Pinpointing proteins 
To develop drugs that target a 
specific cell surface protein, it’s 
helpful to know which other pro- 
teins reside in its vicinity. Geri et 
al. report a light-triggered label- 
ing technique that improves the 
spatial resolution for this type of 
mapping. Specifically, they rely 
on a photocatalyst with a very 
short energy-transfer range to 
activate a carbene-based label 
that can only diffuse a short dis- 
tance in water before reacting. 
They showcase the technique by 
mapping the environment of the 
programmed-death ligand 1 (PD- 
L1) protein on B cell surfaces, a 
system of considerable interest 
in cancer immunotherapy. —JSY 
Science, this issue p. 1091 


NEURODEVELOPMENT 
Diversity from 
development 


When given a line to follow, some 
fruit flies do so carefully and 
others weave. Linneweber et al. 
now show that these behaviors 
are stable for an individual but 
diverse in an isogenic population. 


Key to generating individual 
diversity in the population is 
the inherent chaos of normal 
development. A set of neurons 
in the visual system is wired up 
in a variable manner, result- 
ing in brain circuit asymmetry 
unique to each fly that guides 
its line-walking behavior. With 
more asymmetry in its brain 
circuit, a fly is better able to 
orient to the line. —PJH 
Science, this issue p. 1112 


NEUROSCIENCE 
Human brain activity 
during memory 


Animal studies suggest that 
sequence replay of neuronal 
activity may underlie memory 
retrieval and consolidation. 
However, there is no direct 
evidence that the replay of 
spiking activity sequences is 
important for these processes 
in the human brain. Vaz et 
al. simultaneously recorded 
single-unit spikes, local field 
potential, and intracranial 
electroencephalography 
signals in the brain while 
participants performed a 
memory task. Sharp wave 
ripple oscillations in the 
temporal lobe cortex reflected 
bursts of neural spiking, 
and these bursts of spikes 
organized into sequences 
during memory formation. 
These sequences were 
replayed during successful 
memory retrieval. The extent 
of sequence replay during cor- 
rect recall was related to the 
extent to which cortical spik- 
ing activity was coupled with 
ripples in the medial temporal 
lobe. —PRS 

Science, this issue p. 1131 


Behavioral patterns in fruit flies, Drosophila melanogaster, vary by individual. 
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Edited by Caroline Ash 
and Jesse Smith 


IN OTHER JOURNALS 


Model of the light-harvesting complex II trimer structure for membranes 
containing only bilayer-forming lipids 


PHOTOSYNTHESIS 


Bulky lipids for light-harvesting control 


ipid membranes are extraordinarily complex, with many 
different acyl chains, head groups, and proteins contributing 
to their shape, curvature, and fluidity. ln membranes with 
specialized functions, such as the thylakoid membranes 
within chloroplasts, these properties can influence the func- 
tion of the embedded membrane proteins. Tietz et al. measured 
changes in the behavior of proteoliposome-reconstituted light- 
harvesting complex II, a protein that normally gathers light for 
photosystem Il, in the presence or absence of the non-bilayer- 
forming lipid monogalactosyldiacylglycerol. In this assay, the 
lipid stimulated energy quenching, which would divert light 
energy into heat and be photoprotective in high-intensity light. 
The authors suggest that changes in lateral membrane pressure 
caused by the bulky lipids may influence pigment conformations 
within the protein and thus alter the amount of light energy mov- 
ing through the system. —MAF 
J. Biol. Chem. 295, 1857 (2020). 


ORGANIC CHEMISTRY that a sulfur reagent can help fill 


Sulfurous glue for this gap. Specifically, they show 
that isopropyl sulfiny! chloride 
heterocycles can react successively with two 


Palladium catalysts can stitch 
together a wide variety of carbon- 
based aromatic rings, but they 
often get hung up if both rings 
contain nitrogen. Qin et al. report 


Grignard-activated heterocycles, 
such as pyridine rings, and then 
couple them together by elimina- 
tion. The reaction was compatible 
with a wide variety of five- and 
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Dogs have an 
amazing sense Of 
smell, which cai 


be put to use in the 
early detection of 
tree diseases. 


PLANT DISEASE 


Sniffing out disease 


ogs have extraordinary noses and can detect odors at sensitivities well beyond human 
technologies. Gottwald et al. tested the talents of dogs for sniffing out the pandemic and eco- 
nomically severe bacterial disease of citrus trees called huanglongbing. The dogs were first 
trained by positive reinforcement on potted citrus plants. Gradually, the tests were scaled up 
to grids of 10 trees by 10 trees and finally to orchard conditions. The dogs detected 30-day- 
old infections compared with genetic testing that only started to identify infections 3 months after 
insect-vectored transmission. lf two dogs were used, detection accuracies of 100% were achieved, 
not only in commercial orchards but also among a range of plants in residential areas known to be 
foci of infection. By modeling the data, the authors estimated that dogs could save up to 92% of 
the trees in an orchard by early detection followed by removal of individual infected trees. —CA 
Proc. Natl. Acad. Sci. U.S.A. 117, 3492 (2020). 


six-membered rings, including 
thiazoles and azaindoles. —JSY 
Angew. Chem. Int. Ed. 10.1002/ 
anie.201915425 (2020). 


PHYSIOLOGY 
Humans on a chip 


Organ chips are devices in which 
epithelial and/or stromal cells are 
grown in achamber filled with 
medium connected by a porous 
matrix-like membrane to another 
chamber lined with endothelial 
cells and containing a blood 
substitute. Depending on the cell 
types used and the contents of 
the blood substitute and medium, 
these chips can model organ- 
specific environments. Novak et 
al. devised an automated system 
to connect up to 10 different 
organ chips with a common blood 
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substitute to model physiological 
systems. In a series of reports, 
they show that this system could 
be used to predict drug dynamics 
and drug toxicity. These devices 
have numerous potential applica- 
tions, including modeling complex 
environments and disease devel- 
opment. —GKA 
Nat. Biomed. Eng. 10.1038/s41551- 
019-0497-x, s41551-019-0498-9, 
s41551-019-0495-z (2020). 


SCIENTIFIC WORKFORCE 


It takes a village 

STEM Ph.D. students often select 
a lab based on the reputation of 
the principle investigator (PI), 
who mentors students as they 
develop research skills. Feldon et 
al. sought to empirically deter- 
mine how PI mentorship affects 
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students’ skill development ina 
long-term study involving 336 
Ph.D. students in the biologi- 
cal sciences. Specifically, skills 
including establishing testable 
hypotheses, experimental design, 
and data analysis were measured 
using writing samples and addi- 
tional qualitative data that were 
collected. Results showed that 
the mentoring activities of Pls do 
not significantly predict students’ 
skill development trajectories. In 
fact, Ph.D. students were more 
than four times as likely to have 
positive skill development when 
engaging with postdocs, sup- 
porting the idea that “cascading 
mentorship” is an effective and 
critical element of Ph.D. training. 
—MMc 
Proc. Natl. Acad. Sci. U.S.A. 116, 20910 
(2019). 
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MICROBIAL GENOMICS 
Closing the loop on 
sequencing 


Sequencing of microorganisms 
in the context of metagenomes, 
such as human microbiomes, 
can be difficult. It is not only 
the diversity of species but also 
the occurrence of repeated 
sequences, such as trans- 
posable elements, that are 
challenging. Furthermore, long- 
read sequencing often requires 
large amounts of DNA, which 
can be difficult to obtain from 
a microbiome. To assemble 
full bacterial genomes for 
analysis of the role of repeats 
in bacterial genomes, Moss et 
al. used nanopore sequencing, 
assembly, and error correc- 
tion to extract closed bacterial 
genomes from microbiomes. 
From this, they were able to 
resolve difficult-to-assemble 
bacterial genomes. —LMZ 
Nat. Biotechnol. 10.1038/ 
$41587-020-0422-6 (2020). 


IMMUNOLOGY 
Wisdom of the crowd 


Quorum sensing is used by 
bacteria to sense local cell 
density in order to regu- 
late and coordinate a wide 
array of functions, including 
biofilm formation. Muldoon 
et al. report that a somewhat 
analogous process may play a 
role in macrophage responses. 
Macrophages exposed to 
bacterial lipopolysaccharide— 
a potent immune stimulant— 
showed a bimodal activation 
profile that could be tuned by 
adjusting cell density before 
stimulation. The authors then 
used dynamical modeling and 
single-cell tracking to propose 
a mechanism they termed 
quorum licensing. In bacterial 
quorum sensing, all players are 
uniformly activated. By con- 
trast, quorum licensing entails 
the transformation of “digital” 
single-cell decisions into “ana- 
log” population-wide outputs. 
This phenomenon may help 
macrophages limit inflamma- 
tory responses to local sites of 
injury and infection. —STS 

Nat. Commun. 11, 878 (2020). 
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DEVELOPMENT 
Paths to understanding 
breastmilk 


Breastfeeding is important for 
infant and maternal health. 
Some of the mechanisms that 
account for the benefits of 
breastmilk include improved 
nutritional and immune status, 
but much is still unknown about 
how breastmilk confers health 
benefits. In a Perspective, 
Bode et al. discuss paths to 
understanding the mother- 
breastmilk-infant triad. They 
focus on how improved knowl- 
edge of the triad could inform 
research on two key issues 
for infant health: childhood 
undernutrition and necrotizing 
enterocolitis, a fatal gastroin- 
testinal disorder that affects 
preterm infants. —GKA 
Science, this issue p. 1070 


GERM CELL DEVELOPMENT 
Commencing oogenesis 


In mice, embryonic stem cells 
and induced pluripotent stem 
cells have been shown to dif- 
ferentiate into primordial germ 
cell-like cells that can give rise 
to functional oocytes. In this 
system, Nagaoka et al. identified 
the gene Zgip1 as anecessary 
and sufficient factor for confer- 
ring the oogenic fate to sexually 
undetermined germ cells. As a 
downstream effector of bone 
morphogenetic protein signal- 
ing, conserved transcriptional 
regulator ZGLP1 activates the 
oogenic program repressed by 
Polycomb activities, whereas 
retinoic acid signaling assists the 
maturation of such activation 
and also the repression of the 
primordial germ cell program. 
This study thus refines our 
understanding of mammalian 
oogenic fate determination. 
—BAP 

Science, this issue p. 1089 
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INTEGRATIVE OMICS 
Mapping the 
mammalian brain 


The diverse physiology of the 
brain is reflected in its complex 
organization at regional, cellular, 
and subcellular levels. Sjostedt 
et al. combined data—both 
newly acquired and from other 
large-scale brain mapping 
projects—from transcriptomics, 
single-cell genomics, in situ 
hybridization, and antibody- 
based protein profiling to map 
the molecular profiles in human, 
pig, and mouse brain. The 
analysis is consistent with a con- 
served basic brain architecture 
during mammalian evolution, 
but it does show differences 
in regional gene expression 
profiles. —VV 

Science, this issue p. 1090 


NEUROSCIENCE 
A major psychosocial 
stress circuit 


Psychological stress induces 
various physiological responses 
by activating the sympathetic 
nervous system. The brain cir- 
cuits involved in these functions 
are still not completely under- 
stood. Ina rat model, Kataoka et 
al. combined anatomical tracing, 
immediate early gene expres- 
sion analysis, pharmacology, 
optogenetics, electrophysiol- 
ogy, and genetic cell ablation 
to provide evidence for the 
prominent role of a ventral part 
of the medial prefrontal cortex in 
sympathetic responses to social 
defeat stress. This brain region 
sends excitatory projections to 
the dorsomedial hypothalamus 
as a central coordinator of the 
psychosocial stress responses. 
This pathway is crucial for 
understanding how psychosocial 
stress influences a variety of 
body functions. —PRS 

Science, this issue p. 1105 
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LIQUID CRYSTALS 
Watching defects 
flow and grow 


Orientational topological defects 
in liquid crystals, known as 
disclinations, have been visual- 
ized in polymeric materials or 
through mesoscale simulations 
of the local orientation of the 
molecules. Duclos et al. report 
the experimental visualization 
of the structure and dynamics 
of disclination loops in active, 
three-dimensional nematics 
using light-sheet microscopy 
to watch the motion of nematic 
molecules driven by the motion 
of microtubule bundles (see 
the Perspective by Bartolo). 
This setup makes it possible to 
directly watch the nucleation, 
deformation, recombination, and 
collapse of spatially extended 
topological defects in three 
dimensions. —-MSL 

Science, this issue p. 1120; 

see also p. 1075 


MICROBIOLOGY 
Porin’ through the wax 


Mycobacterium tuberculosis 
has a distinctive physiology that 
allows it to persist in the human 
body, including a wax-like cell 
coat that is largely imperme- 
able and can resist attack by 
host immune response effec- 
tors. Wang et al. identified a 
simple molecule that effectively 
crosses this cell coat and kills 
M. tuberculosis cells. Whole- 
genome sequencing of mutants 
resistant to this molecule, 3,3-bis- 
di(methylsulfonyl)propionamide, 
showed scattered mutations ina 
protein called PPE51, and these 
mutants resulted in a spec- 
trum of nutrient-related growth 
defects. Experiments suggest 
that PE/PPE family proteins 
are small molecule—selective 
channels analogous to outer 
membrane porins, which allow M. 
tuberculosis to take up nutrients 
while maintaining an otherwise 
impermeable barrier. —CA 
Science, this issue p. 1147 
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CANCER 
Palmitoylation makes 
the switch for EGFR 


Aberrant epidermal growth 
factor receptor (EGFR) signaling 
through two important path- 
ways, RAS—mitogen-activated 
protein kinase (RAS-MAPk) and 
phosphatidylinositol 3-kinase 
(PI3K), promotes tumor growth. 
EGFR inhibitors are rarely effec- 
tive in tumors with mutations 
in the protein KRAS, which 
stimulate the MAPK pathway 
independently of EGFR activ- 
ity. Knarbanda et al. found 
that EGFR palmitoylated on 
the intracellular tail interacted 
preferentially with a PI3K subunit 
rather than a MAPK adaptor 
protein. Blocking palmitoylation 
reduced PI3K signaling activ- 
ity and sensitized cells to PI3K 
inhibitors. These results explain 
why EGFR inhibitors are effective 
only in cancer cells in which 
EGER is palmitoylated. 
—LKF 

Sci. Signal. 13, eaax2364 (2020). 


THYMUS 


PAX] in the thymus 


Severe combined immunode- 
ficiency encompasses a wide 
spectrum of genetic disorders. 
Yamazaki et al. studied immune 
deficits in six patients with 
otofaciocervical syndrome 

type 2, a genetic abnormality 
attributed to biallelic mutations 
in the transcription factor PAX1. 
In addition to immunodeficiency, 
the disease is also characterized 
by facial dysmorphism, hearing 
loss, and skeletal abnormalities. 
The patients were subjected 

to hematopoietic stem cell 
transplantation to rectify their 
immunodeficiency. Despite 
successful engraftment in three 
of the patients, all three failed 
to develop T cells. By generat- 
ing patient-derived induced 
pluripotent stem cells and 
differentiating them ex vivo into 
thymic epithelial progenitor 
(TEP) cells, the authors found 
that PAX1 plays an important 
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role in regulating the develop- 
ment of TEP cells. —AB 
Sci. Immunol. 5, eaax1036 (2020). 


SOLAR CELLS 
Tuning band gaps 
with three halides 


Tandem solar cells can boost 
solar cell efficiency by using two 
active layers to absorb the solar 
spectrum more completely, 
provided that the two cells are 
current-matched. Inorganic- 
organic perovskites tuned to 
the appropriate wide band gap 
(~1.7 electron volts) as top cells 
that contained iodine and bro- 
mine or bromine and chlorine 
have short carrier diffusion 
lengths and undergo photo- 
induced phase segregation. 
Xu et al. now report a method 
for incorporating chloride that 
allows for fabrication of stable 
triple-halide perovskites with a 
band gap of 1.67 electron volts. 
Two-terminal tandem silicon 
solar cells made with this mate- 
rial had a power conversion 
efficiency of 27%. —PDS 
Science, this issue p. 1097 


ELECTRON MICROSCOPY 
Seeing single silicon 
atom vibrations 


Vibrational spectroscopy can 
achieve high energy resolu- 
tion, but spatial resolution of 
unperturbed vibrations is more 
difficult to realize. Hage et 
al. show that a single-atom 
impurity in a solid (a silicon 
atom in graphene) can give 
rise to distinctive localized 
vibrational signatures. They 
used high-resolution electron 
energy-loss spectroscopy ina 
scanning transmission electron 
microscope to detect this signal. 
An experimental geometry was 
chosen that reduced the relative 
elastic scattering contribution, 
and repeated scanning near 
the silicon impurity enhanced 
the signal. The experimental 
vibration frequencies are in 
agreement with ab initio calcula- 
tions. —PDS 

Science, this issue p. 1124 
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GERM CELL DEVELOPMENT 


ZGLP1 is a determinant for the oogenic fate in mice 


So |. Nagaoka, Fumio Nakaki, Hidetaka Miyauchi, Yoshiaki Nosaka, Hiroshi Ohta, 
Yukihiro Yabuta, Kazuki Kurimoto, Katsuhiko Hayashi, Tomonori Nakamura, 


Takuya Yamamoto, Mitinori Saitou* 


INTRODUCTION: Mammals generate sexually di- 
morphic germ cells, oocytes and spermatozoa, 
which form the basis for sexual reproduction. 
Studies in mice have shown that in embryonic 
ovaries, retinoic acid (RA) and its downstream 
effector, STRAS8, induce sexually uncommitted 
germ cells into the oogenic fate, whereas in 
embryonic testes, RA is degraded by Sertoli 
cells and other mechanisms elicit the sper- 
matogenic pathway. Conversely, our recent 
study has shown that RA and STRAS8 are not 
sufficient to induce in vitro mouse primordial 
germ cell-like cells (mPGCLCs) into the oogenic 
fate. Instead, bone morphogenetic protein 
(BMP), which is expressed in granulosa cells, 
and RA synergistically confer the oogenic path- 
way on mPGCLCs. This finding creates an 
experimental framework for a systematic un- 
derstanding of the mechanism of oogenic fate 
determination. 


RATIONALE: Using mouse embryonic stem cells 
(mESCs) as a starting material, our in vitro 
system reconstitutes PGC specification, epige- 
netic reprogramming, and oogenic fate de- 
termination under a defined condition. We 
reasoned that this system should serve as a 
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powerful platform with which to analyze the 
mechanism of germ cell development, includ- 
ing that for oogenic fate determination. Here, 
we set out to identify a transcriptional regula- 
tor that functions downstream of BMP signal- 
ing for oogenic fate determination by screening 
with RNA sequencing and to explore the regu- 
lation and function of a key candidate using 
both the in vivo model and the in vitro sys- 
tem through loss- or gain-of-function experi- 
ments. We reasoned that these experiments 
should clarify the distinctive functions of 
the BMP and RA pathways in oogenic fate 
determination. 


RESULTS: We identified Zglp1, which encodes 
an evolutionary conserved transcriptional reg- 
ulator with GATA-like zinc fingers, as a gene 
that showed specific up-regulation in mPGCLCs 
in response to BMP, but not to RA. In females, 
ZGLP1 showed specific and transient expression 
in embryonic germ cells during oogenic fate 
determination [from embryonic day (E) 12.0 to 
E15.5], whereas in males, it was not detected in 
embryonic germ cells undergoing spermato- 
genic fate determination but was expressed in 
postnatal undifferentiated or differentiating 


mPGCLCs + ZGLP1 


ZGLP1 induces the oogenic fate in mice. Images show immunofluorescence staining of SYCP3 (yellow) 
and DDX4 (magenta) expression in E15.5 oocytes (left) and mPGCLCs in vitro overexpressing ZGLP1 

(right). ZGLP1 alone, which acts downstream of the BMP signaling, is sufficient to induce mPGCLCs into 
oocyte-like cells in the meiotic prophase with SYCP3-positive synaptonemal complexes. Scale bar, 10 um. 
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spermatogonia. In females, Zg/p1 was essen- 
tial for the oogenic fate determination, with 
no germ cells differentiating into oocytes in 
the meiotic prophase in Zglp1-knockout mice, 
whereas in males, it was dispensable for germ 
cell sex determination but was required in 
the spermatogonia for an efficient completion 
of meiotic prophase. ZGLP1 overexpression 
replaced the functions of the BMP signal- 
ing and induced mPGCLCs into the oogenic 
fate and meiotic prophase with up-regulation 
of Stra8 in the absence of RA. We found that 
the key role of RA signaling was to contrib- 
ute to maturation of the overall oogenic pro- 

gram and to repression of 
the PGC program. Con- 
sistently, Zglp1 showed 
a broad regulatory cover- 
age of the transcriptome 
for the oogenic program, 
which included the pro- 
gram for RNA processing, transcription and 
chromatin modification, retrotransposon reg- 
ulation, meiosis, and oocyte development, 
whereas Stra8, which functioned in part down- 
stream of ZglpI, mainly regulated the meiotic 
program. Chromatin immunoprecipitation se- 
quence analysis suggested that ZGLP1 pref- 
erentially activates key genes repressed by 
Polycomb activities in sexually uncommitted 
germ cells. 


Read the full article 
at http://dx.doi. 
org/10.1126/ 
science.aaw4115 


CONCLUSION: Our results demonstrate that 
BMP signaling and ZGLP1 play a central 
role in establishing the key programs for 
the oogenic fate, with RA signaling assist- 
ing in the maturation of these programs, 
including repression of the early PGC pro- 
gram. These findings provide an integrated 
paradigm for mammalian oogenic fate de- 
termination. The distinct function of ZGLP1 
between females and males represents a 
sexual dichotomy of the mechanism for the 
onset and progression of the oogenic and 
spermatogenic processes, including meiosis. 
A better understanding of the mechanism 
of sex determination in germ cells should 
serve as a key, not just for further promot- 
ing in vitro gametogenesis (IVG) studies, 
including human IVG, but also for delin- 
eating the etiology of critical diseases such 
as infertility and genetic and epigenetic dis- 
orders of offspring. Furthermore, an in vitro 
system that induces meiotic recombination 
in a physiologically relevant context propels 
future studies to understand the mechanisms 
for generating genetic diversity. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: saitou@anat2.med. 
kyoto-u.ac.jp 
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GERM CELL DEVELOPMENT 


ZGLP1 is a determinant for the oogenic fate in mice 


So |. Nagaoka’, Fumio Nakaki2*, Hidetaka Miyauchi*+, Yoshiaki Nosaka’, Hiroshi Ohta’, 
Yukihiro Yabuta’?, Kazuki Kurimoto*+, Katsuhiko Hayashi2§, Tomonori Nakamura”, 


Takuya Yamamoto"***, Mitinori Saitou’*>| 


Sex determination of germ cells is vital to creating the sexual dichotomy of germ cell development, thereby 
ensuring sexual reproduction. However, the underlying mechanisms remain unclear. Here, we show that 
ZGLP1, a conserved transcriptional regulator with GATA-like zinc fingers, determines the oogenic fate in mice. 
ZGLP1 acts downstream of bone morphogenetic protein, but not retinoic acid (RA), and is essential for 

the oogenic program and meiotic entry. ZGLP1 overexpression induces differentiation of in vitro primordial 
germ cell-like cells (PGCLCs) into fetal oocytes by activating the oogenic programs repressed by Polycomb 
activities, whereas RA signaling contributes to oogenic program maturation and PGC program repression. 
Our findings elucidate the mechanism for mammalian oogenic fate determination, providing a foundation for 
promoting in vitro gametogenesis and reproductive medicine. 


n mammals, the chromosomes specify the 

sex of somatic cells, which in turn activates 

the oogenic or spermatogenic program in 

developing germ cells. This manifests at 

around embryonic day (E) 12.5 in mouse 
germ cells that are colonized in embryonic 
ovaries or testes (1). A prevalent hypothesis 
posits that in females, retinoic acid (RA) in- 
duces the expression of STRA8, which acts as a 
key regulator for germ cells (oogonia) to adopt 
the oogenic fate and enter into the meiotic 
prophase, whereas in males, RA is degraded 
by CYP26B1 in embryonic Sertoli cells, and 
germ cells (prospermatogonia) ensheathed 
by such cells enter the spermatogenic pathway 
(2-4). However, in a recent in vitro analysis, we 
showed that RA and the expression of STRA8 
are not sufficient to induce the oogenic fate in 
mouse primordial germ cell-like cells (mPGCLCs) 
derived from mouse embryonic stem cells 
(mESCs). Instead, bone morphogenetic proteins 
(BMPs), which are expressed strongly in embry- 
onic granulosa cells (5, 6), and RA synergisti- 
cally confer the oogenic pathway and meiotic 
program on mPGCLCs (7). This finding not 
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only defines a signaling principle that drives 
the oogenic pathway, but also creates an ex- 
perimental framework for its systematic anal- 
ysis. Here, we identify a key transcription 
factor (TF) downstream of BMP and estab- 
lish an integrated paradigm for oogenic fate 
determination. 


A system for identifying key TFs for the 
oogenic fate 


In the in vitro system, mPGCLCs isolated by 
fluorescence-activated cell sorting (FACS) for 
the expression of Prdmi-mVenus [also known 
as Blimpl-mVenus (BV)] and Dppa3-ECFP 
[also known as Stella-ECFP (SC)] transgenes 
are cultured in the presence of forskolin and 
rolipram, which elevate intracellular cAMP 
levels and enhance PGC(LC) expansion (8), and 
these cells are provided with BMP and RA from 
culture day 3 (c3) onward (Fig. 1A) (7). Under 
these conditions, mPGCLCs differentiate into 
fetal oocyte-like cells that are positive for SYCP3, 
a key synaptonemal complex component, and 
proceed into meiotic prophase, reaching the 
pachytene stage at c9 (Fig. 1, A and B) (7). BMP 
alone can also induce mPGCLCs into fetal 
oocyte-like cells, albeit with low efficiency 
(Fig. 1, A and B) (7). Using RNA-sequencing 
(RNA-seq) technology (9), we screened genes 
encoding TFs preferentially up-regulated in 
response to BMP (fig. S1A and tables S1 and 
$2). Considering genes known to function in 
meiotic initiation (J0, 11), we selected eight 
genes, Dazl, Gata2, Id1, 1d3, Msx1, Msx2, Stra8, 
and Zglpl, for further analyses (fig. S1, A and B). 

We established a system to overexpress the 
candidates in mPGCLCs in a doxycycline (Dox)- 
dependent fashion (Fig. 1A) (12, 13). Immuno- 
fluorescence (IF) analyses revealed that the 
provision of Dox and RA to the eight-gene 
transfectants from c3 onward leads to the 
induction of SYCP3* fetal oocyte-like cells in 
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most BV" and SC* cells at c9 (Fig. 1, A and B), 
suggesting that one or more of the eight fac- 
tors acts as a BMP effector and, together with 
RA, induces the oogenic fate. The provision of 
Dox alone also induced the SYCP3* cells (Fig. 1, 
A and B). We therefore performed eight sets 
of experiments by providing Dox and RA to 
the seven-gene transfectants produced by with- 
drawing each of the eight genes in turn, which 
revealed that each of the seven-gene combi- 
nations, except that lacking Zglp/, resulted in 
the induction of SYCP3* cells (Fig. 1, A and B). 
Accordingly, Zglp1 overexpression, but not 
overexpression of the other genes, including 
Gata2, IdI, and MszI, and the provision of RA 
resulted in a robust induction of SYCP3* cells 
(Fig. 1, A and B, and fig. $1, C and D). Zglp1 
overexpression without RA also generated 
SYCP3" cells as a substantial fraction of BV* 
and SC* cells (Fig. 1, A and B) (see below). We 
therefore explored the function of Zglp/ in 
oogenic fate determination. 


ZGLP1 is essential for the oogenic fate 


Zglp1 encodes a transcriptional regulator bear- 
ing a GATA-like zinc finger conserved across 
the metazoan phyla (fig. S2) (74, 15). It has been 
reported that Zglp1 is expressed in gonadal 
somatic cells, but not in germ cells, and is es- 
sential both for oogenesis and spermatogenesis 
(14, 15). However, our RNA-seq and IF analyses 
revealed that during the embryonic period, 
Zglp1/ZGLP1 exhibits specific and transient 
expression in DDX4* germ cells in females, but 
not in somatic cells or males, beginning at E12.0 
and waning after E14.5, a key period for the sex 
determination of germ cells (Fig. 2, A and B, 
and fig. SIB). ZGLP1 preceded STRAS in expres- 
sion by a period of >1 day (Fig. 2, A and B). Post- 
natally, Zglp1 showed specific expression in 
Zotb16"-undifferentiated and Kit*-differentiating 
spermatogonia (fig. S3, A to C) (J6). 

We analyzed Zglp1 homozygous knockout 
(Zglpt! ~) mice (fig. S4A) (14, 15). Compared 
with wild-type ovaries, Zglpr! ~ ovaries at post- 
natal day (P) 8 and at 6 weeks were highly 
atrophic with no ovarian follicles. Ovaries as 
early as E17.5 contained a drastically reduced 
number of DDX4" cells (Fig. 2, C and D, and 
fig. S4B). At E15.5, although a substantial num- 
ber of DDX4* cells remained in Zelpr! ~ Ova- 
ries, a majority of them (290%) were negative 
for SYCP3 even though they expressed STRA8 
(Fig. 2, D to F, and fig. S4, C to E; see below for 
the level of Stra8/STRA8 in Zelpr! ~ cells). The 
less prevalent SYCP3* cells did not display fea- 
tures suggestive of meiotic entry, such as telo- 
mere clustering (17) or the expression of DMC1 
(18, 19) or YH2AX (20) (Fig. 2, E and F, and fig. 
S4, F and G). Note that the Zeglpr' ~ female 
germ cells bore more severe phenotypes than 
the Stras”/~ cells, some of which survived to 
form mature ovarian follicles despite their fail- 
ure in meiosis (27). 
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Fig. 1. Identification of a TF inducing the oogenic fate. (A) Left: Top, Scheme 
for the oogenic fate induction by BMP2- or Dox-induced TF overexpression 

with or without RA in mPGCLCs. m220, m220 feeder cells; SCF, stem cell factor: 
FR1O, forskolin and rolipram (10 uM each). Bottom: Vectors for the Dox-inducible 
system. D424, an insulator; EFla, elongation factor lo. promoter; CDS, a coding 
DNA sequence from a candidate gene; IRES, internal ribosome entry site; Neo, 
neomycin-resistance gene; Pac, puromycin-resistance gene; pB-TR, piggyBac terminal 
repeat; rtTA, reverse tetracycline transactivator; tetO0/mCMV, tetracycline operator 


The Zelpr! ~ male germ cells developed nor- 
mally until at least P7 (fig. S5, A and B). Sub- 
sequently, they exhibited severe impairment 
in the first wave of spermatogenesis: unlike in 
females, they entered into meiotic prophase 
but mostly failed to progress beyond the zy- 
gotene stage, resulting in a large fraction of 
abnormal pachytene cells both at P15 and 
P20 (fig. S5, A to F). The Zglpr! ~ adults ex- 
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8 factors—Zglp1 _ 8 factors 8 factors 
(+ Dox) (+Dox+RA) (+ Dox +RA) (+ Dox) 


Zglp1 O.E. 


Zglp1 OE 
(+ Dox + RA) 


hibited severe defects in spermatogenesis 
and were infertile (fig. S5, Gand H) (74, 15). 
Nonetheless, upon aging, a small fraction of 
the Zelpr! ~ cells generated round or elongated 
spermatids, which, with intracytoplasmic sperm 
injection, contributed to apparently healthy off- 
spring (fig. $5, I to O). Thus, in females, Zglp1 
is essential for activating the oogenic program, 
including meiotic prophase, whereas in males, 
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fused to the minimal cytomegalovirus promoter. Right: Percentages of SYCP3* cell 
induction among GFP* (BV* and SC*) mPGCLC-derived cells at c9 under the 
conditions indicated. For each condition, mPGCLC-derived cells were imaged so that 
a minimum of 20 BV* and SC* cells were included per image and at least 10 such 
images were acquired from two to four biological replicates. (B) Representative 
images for the expressions of GFP (BV and SC) (cyan), DDX4/mCherry (magenta), 
and SYCP3 (yellow) in mPGCLC-derived cells at c9 under the conditions indicated. 
Cells were counterstained with DAPI (gray). Scale bars, 10 um. 


it is dispensable for germ cell sex determina- 
tion but is required in the spermatogonia for 
an efficient completion of meiotic prophase. 


ZGLP1 is sufficient to create the foundation of 
the oogenic fate 


Next, we explored the regulation and function 
of Zglp1. We examined the expression of Zglp1 
and other relevant genes by mPGCLC-derived 
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Fig. 2. Expression and requirement of ZGLP1 in oogenic fate determination. 
(A) Expression of ZGLP1 (green) and STRA8 (yellow) in DDX4* (magenta) germ 
cells in fetal ovaries and testes from E11.5 to E15.5. Cells were counterstained with 
DAPI (gray). Yellow arrows indicate ZGLP1* cells in E12.0 fetal ovaries. Scale bars, 
10 um. (B) Percentages of ZGLP1* or STRA8* cells among DDX4" cells per ovarian 
section (see the materials and methods). The numbers in parentheses represent 
the number of embryos analyzed. (C) Ovarian sections of Zgip1*”* and ZglpI~ mice 
at E17.5 immunostained for DDX4 (magenta). Scale bars, 100 um. (D) Numbers 


of DDX4* cells per ovary from Zgip1*’* and ZglpI~’~ mice at E13.5, E15.5, and E17.5 
(see the materials and methods). Two to three embryos were analyzed for each 
genotype at each time point. Black bars represent mean values. NS, not significant. 
(E) Expression of SYCP3 (yellow) in DDX4* (magenta) cells in ovaries of Zglp1*”* 
and Zgipl’’~ mice at £15.5. Scale bars, 10 um. (F) Percentages of DDX4* cells in 
ovaries of Zglp1*”* and ZglpI-’ mice at E15.5 showing distinct SYCP3 staining 

as categorized in fig. S4C. Two embryos for each genotype and at least 250 DDX4* 
cells were analyzed per embryo. Error bars indicate SD. 


cells under four conditions: control culture 
or aculture with BMP2, RA, or both BMP2 and 
RA. Zglp1 was undetectable in the control, 
whereas it was up-regulated as early as 24: hours 
after BMP2 stimulation and elevated fur- 
ther thereafter (fig. S6A). The manner of up- 
regulation of Zglp1 in response to BMP2 was 
similar to that of Jd, a well-known BMP tar- 
get (22) (fig. S6A). There was no up-regulation 
of Zglp/ in response to RA (fig. S6A). Combined 
with the data showing that Zg/p1 up-regulation 
by BMP2 is impaired by LDN193189, a selective 
inhibitor of ALK2/3 receptors (fig. S6B), these 
findings confirm that Zglp1 is a downstream 
target of BMP signaling. Although Stra8 was 
up-regulated as early as 24 hours after RA 
stimulation and elevated further thereafter, it 
was also up-regulated, albeit at lower levels, 
in the control at late stages such as c7 and c9 
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(fig. S6C). Such up-regulation could be an effect 
of genome-wide DNA demethylation, including 
the demethylation of the Stra8 promoter, during 
the mPGCLC culture (8, 23) (fig. S6D). Note that 
BMpP2 enhanced Stva8 and Rec8 expression at 
late stages such as c7 and c9 (fig. S6C), and these 
enhancements could have occurred, at least in 
part, through the action of ZGLP1 (see below). 
By contrast, RA-responsive genes (24) with no 
apparent roles in the oogenic pathway (Cyp26a1, 
Rarb, and Hoxal) were undetectable in the con- 
trol culture or the culture with BMP2 (fig. S6C), 
indicating that there is little, if any, RA activity 
in the culture. 

We cloned mESC lines expressing high (A2), 
middle (C2), and low (H2) levels of Zglpi1, re- 
spectively, upon Dox administration (fig. S7, A 
and B). ZGLP1 overexpression alone in mPGCLCs 


and meiotic entry (Ddx4, Dazl, Stra8, Rec8, 
Sycp3, Spoil, and Dmc1) and induced SYCP3* 
cells in a dose-dependent manner: notably, 
the A2 clone with the highest Zglp1 expres- 
sion activated oogenic and meiotic genes 
robustly and induced as much as ~90% of 
mPGCLCs into SYCP3* cells (fig. S7, C to E). 
RA enhanced such activations, particularly 
in the H2 and C2 clones with low and middle 
ZGLP1-overexpression levels, respectively (fig. 
S7, C and E). These findings suggest that 
ZGLP1 is sufficient to induce the oogenic fate, 
whereas RA augments the ZGLP1-activated 
oogenic program. We validated that although 
LDN193186 impaired the BMP-dependent up- 
regulation of oogenic and meiotic genes, it had 
no impact on that by ZGLP!1 (fig. S7F), cor- 
roborating that ZGLP1 is a key BMP effector 


up-regulated key genes for the oogenic fate 
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for the oogenic fate. 
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We examined the consequences of ZGLP1 
overexpression in a culture lacking fetal bovine 
serum and including WIN18,446, an inhibitor 
of aldehyde dehydrogenases and thereby of 
RA biosynthesis from retinol (25), a condition 
under which the presence of RA can be ne- 
gated more formally. Even under this condi- 
tion, ZGLP1 attained a robust induction of 
the SYCP3" cells (fig. S8, A to C). We explored 
the effect of BMS493, an inverse agonist of 
RA receptors (RARs) that enhances the in- 
teraction of RARs with their corepressors (26). 
BMS493 therefore stabilizes a repressive chro- 
matin state near RAR-bound genes, com- 
promising their activation irrespective of the 
presence of RA (26). BMS493 inhibited the 
induction of SYCP3* cells by ZGLP1 to a sub- 
stantial extent; however, even in the presence 
of 1 or 10 uM BMS493, which fully prevents 
the activity of 100 nM RA (fig. S8D), ZGLP1 
induced ~20% of BV* and SC* cells into SYCP3* 
cells and up-regulated the relevant genes (fig. 
S8, E to G). Collectively, these findings indicate 
that ZGLP1 is sufficient to create the foundation 
of the oogenic fate in the absence of RA, that 
this function involves the activation of oogenic 
and meiotic genes both through RAR-dependent 
(BMS-sensitive) and RAR-independent (BMS- 
insensitive) mechanisms, and that RA enhances 
this pathway through RAR-mediated transcrip- 
tional regulation. 


ZGLP1 reconstitutes key transcriptome 
programs for the oogenic fate 


To examine the extent to which ZGLP1 over- 
expression recapitulates the oogenic program, 
we determined the transcriptome dynamics 
associated with ZGLP1 overexpression in 
mPGCLCs (the A2 clone) under the various 
conditions examined thus far (tables S1 and 
$3). Principal component analysis (PCA) re- 
vealed that mPGCLCs with ZGLP1 alone, as 
well as those under a serum-free condition 
with WIN18,446, progressed along the oogenic 
pathway, gaining, at c9, properties similar to 
those of E14.5 and E15.5 fetal oocytes (Fig. 3A 
and fig. S9A). A spread analysis revealed that 
up to 3% of mPGCLCs with ZGLP1 alone pro- 
ceed to the pachytene stage by c9 (Fig. 3, B and 
C). mPGCLCs with ZGLP1 and BMS493 also 
adopted the oogenic program, gaining, at c9, 
more advanced properties than E13.5 fetal 
oocytes (fig. S9B). The addition of RA facili- 
tated the ZGLP1-activated oogenic program, 
with mPGCLCs with ZGLP1 and RA attaining 
a transcriptome similar to that of E14.5 and 
E15.5 fetal oocytes at c7 (Fig. 3A) and up to 14% 
of them proceeding to the pachytene stage by 
c9 (Fig. 3C). 

We defined the early PGC (315 genes: Nanog, 
Sox2, Prdmi, Prdml4, Tfap2c, etc.), late germ 
cell (250 genes: Dazl, Ddx4, Piwil2, Mael, 
Mov1011, etc.), and fetal oocyte (468 genes: 
Stra8, Rec8, Sycp3, Dmcl1, Sycpl1, etc.) genes 
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that characterize the transition of transcription 
profiles for the oogenic program (7). mPGCLCs 
with ZGLP1 (without RA, serum-free with 
WINI18,446) activated late germ cell and fetal 
oocyte genes to an extent comparable to fetal 
oocytes in vivo (fig. S9C), and mPGCLCs with 
ZGLP1 and BMS493 also activated such genes, 
with up-regulation of genes including Stra8 
and Rec8 as early as 24 hours after Dox ad- 
ministration (fig. S9D). Conversely, mPGCLCs 
with ZGLP1 repressed early PGC genes mod- 
erately, and mPGCLCs with ZGLP1 and BMS493 
failed, to a greater extent, to repress them (fig. 
S9C). By contrast, mPGCLCs with ZGLP1 and 
RA activated late germ cell and fetal oocyte 
genes and repressed the early PGC genes in 
a manner parallel to female germ cells in vivo 
(fig. S9C), highlighting the role of RA sig- 
naling in augmenting the activation of the 
oogenic program and repressing the early 
PGC program. 

Compared with the control, mPGCLCs with 
ZGLP1 up-regulated 489 genes, which were 
enriched in gene ontology (GO) terms key to 
the oogenic fate, such as “oogenesis,” “meiotic 
cell cycle,” and “piRNA metabolic process” 
(Fig. 3D). mPGCLCs with BMP2, with ZGLP1 
(serum-free with WIN18,446), and with ZGLP1 
and BMS493 up-regulated a similar number of 
genes (457, 505, and 413 genes, respectively) 
with key GO enrichment (Fig. 3E), and the 
ZGLP1-overexpressing cells showed no up- 
regulation of typical BMP downstream genes, 
including /d1/3 and Ms! (fig. S9, E and F). 
mPGCLCs with ZGLP1 and RA up-regulated 
a larger number of genes (1178), which encom- 
passed nearly all of the genes up-regulated 
in mPGCLCs with ZGLP1 (Fig. 3D). None- 
theless, the genes specifically up-regulated 
above the threshold [log,(fold change) > 2] 
by ZGLP1 and RA (713 genes, with enrichment 
for “cilium assembly,” “protein ubiquitination/ 
phosphorylation,” and “gametogenesis”) were 
up-regulated, albeit to a lesser extent, by ZGLP1 
alone (fig. S9G). 

mPGCLCs with ZGLP1 down-regulated 294 
genes, which were enriched in GO terms such 
as “regulation of transcription” and “glycolytic 
process” (Fig. 3F). Consistently, mPGCLCs with 
BMP2, with ZGLP1 (serum-free with WIN18,446), 
and with ZGLP1 and BMS493 down-regulated 
a similar number of genes (107, 187, and 185 
genes, respectively) (Fig. 3G). mPGCLCs with 
ZGLP1 and RA down-regulated 1063 genes, 
which encompassed nearly all of the genes 
down-regulated in mPGCLCs with ZGLP1 (Fig. 
3F). The genes specifically down-regulated 
above the threshold [log,(fold change) > 2] by 
ZGLP1 and RA (795 genes, with enrichment 
for “metabolic process,” “stem cell mainte- 
nance,” and “G1/S transition of mitotic cell 
cycle”) were down-regulated only to a small 
extent by ZGLP1 alone (fig. S9H). Collectively, 
these findings demonstrate that the BMP sig- 
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naling and ZGLP1 play a central role in es- 
tablishing the key programs for the oogenic 
fate, with RA signaling assisting its matura- 
tion, including the repression of the early PGC 
program. 


STRA8 activates meiotic genes in a 
ZGLP1-dependent context 


To compare the role of Zglp1 with that of Stva8, 
we generated Zglp1 and Stra8 homozygous 
knockout (Zglpr! ~ and Stras”, respectively) 
mESCs (fig. $10, A to D). At least two Zglpr/~ 
and two Stras/~ independent clones gave es- 
sentially identical phenotypes (see below), and 
we present representative data hereafter. The 
Zelpr! ~ mESCs differentiated into apparently 
normal mPGCLCs, which propagated in cul- 
ture in the presence of forskolin and rolipram; 
however, upon stimulation with BMP and RA, 
the Zglpr' ~mPGCLCs were retarded in down- 
regulating BV and SC expression, failed to es- 
tablish premeiotic DNA replication, and failed 
to differentiate into SYCP3* fetal oocyte-like 
cells (fig. S10, E to H). 

The transcriptome analyses revealed that in 
response to BMP2, Zelpr' ~ mPGCLCs failed 
to up-regulate most of the genes that were up- 
regulated in wild-type mPGCLCs except genes 
such as JdI and Jd2 (Fig. 4, A and B). Ac- 
cordingly, Zglpr! ~ mPGCLCs with BMP and 
RA progressed very poorly along the female 
pathway, only gaining properties similar to 
those of ~E12.5 germ cells even at c9 (Fig. 4C). 
Stra8"'/- mPGCLCs with BMP and RA also 
failed to establish premeiotic DNA replication 
and to fully differentiate into fetal oocyte-like 
cells (fig. S10, I and J) (7), yet they progressed 
further than Zglpr' ~ mPGCLCs, acquiring the 
properties of germ cells between E13.5 and 
E14.5 to E15.5 (fig. SIA) (7). Although Zglp1 
expression was unaffected in Stras”!~ cells, 
Stra8/STRAS8 was reduced to less than half in 
Zglpr! ~ cells (Fig. 4, D and E). Furthermore, 
although the addition of RA at higher concen- 
trations (1 and 10 uM) reinforced Stra8 up- 
regulation in Zglpr! ~ cells, it had no impact 
on the up-regulation of other oogenic or meiotic 
genes in these cells (Fig. 4F). To explore whether 
Zglp1 overexpression overcomes the defects 
in Stras’/- cells, we disrupted Stra8 in the A2 
clone (fig. S11, B and C). The mPGCLCs from 
the Stra8”/~ A2 clone with Dox and RA ex- 
hibited phenotypes similar to those of the 
Stra8s~/~ mPGCLCs with BMP2 and RA (fig. 
S11, D and E). Thus, although Zglp/ is essen- 
tial for launching the overall oogenic program, 
Stra8 is an indispensable downstream effector 
of Zglp1 for the meiotic program. 

The number of genes that failed to be up- 
regulated in Zglpr! ~ mPGCLC-derived cells 
with BMP and RA (530 genes) was much greater 
than that in Stras’~ cells (224 genes) (Fig. 4G). 
Although such genes in Stras!- cells, a majority 
of which (199/224 genes, ~89%) also failed to 
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Fig. 3. Induction of the oogenic program by ZGLP1 and its augmentation 
by RA. (A) PCA of the transcriptomes of germ cells in vivo (E9.5 to E11.5 PGCs, 
E12.5 to E15.5 female or male germ cells) and mPGCLCs from Zgip1- 
overexpressing (0.E.) clones cultured with Dox or with Dox and RA. Each plot 
represents one transcriptome. Transcriptomes from two replicates (for E9.5 
PGCs and E14.5 to E15.5 oocytes, three replicates) are plotted. The color code is 
as indicated. (B) Representative images showing the substages of meiotic 
prophase | in ZGLP1 O.E. mPGCLC-derived cells and E15.5 oocytes. Germ cells 
were spread and immunostained for SYCP1 (red), SYCP3 (green), and yH2AX 
(gray). Arrow indicates an unsynapsed X chromosome (this clone is XO) 

(fig. S7B). Scale bars: 5 um for ZGLP1 O.E. cells, 10 um for oocytes. (C) Meiotic 
progression of SYCP3* mPGCLC-derived cells (at c7, c8, and c9) and E15.5 


oocytes (see the materials and methods). The numbers of SYCP3* cells analyzed 
are shown in parentheses above each bar. (D and F) Overlap of up-regulated 
or down-regulated genes between mPGCLCs overexpressing ZGLP1 without or 
with RA. The numbers of the genes up-regulated or down-regulated [logo(RPM + 1) 
> A, loga(fold change) > 2 from the control, either at c4, c5, c7, or c9] are 
shown. Key GO enrichments are shown. (E and G) GO enrichment among genes 
up-regulated or down-regulated in mPGCLC-derived cells and in E15.5 oocytes. 
For mPGCLC-derived cells, the numbers of the genes up-regulated or down- 
regulated [logo(RPM + 1) > 4, logs(fold change) > 2, at c4, c5, c7, or c9] from 
the control culture (clone H18 for BMP2-treated and the Zglp1 O.E. clone A2 
for other treatments) are shown. For E15.5 oocytes, the numbers of genes 
up-regulated or down-regulated from E9.5 PGCs are shown. 


be up-regulated in Zglpr! ~ cells, were mainly 
associated with the processes for the meiotic 
prophase such as “synapsis” and “meiotic cell 
cycle,” the genes that failed to be up-regulated 
exclusively in Zglpr! ~ cells included those with 
wider implications for fetal oocyte develop- 
ment, such as “oogenesis,” “RNA secondary 
structure unwinding,” and “transcription and 
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chromatin modification,” in addition to those 
associated with the meiotic prophase (Fig. 4G). 
Similarly, the number of genes that failed to 
be down-regulated in Zglpr! ~ cells with BMP 
and RA (281 genes) was greater than that in 
Stras'~ cells (152 genes) (Fig. 4H). We con- 
clude that Zglp1 broadly regulates the oogenic 


least in part downstream of Zglp1, mainly reg- 
ulates the meiotic program. 


A mechanistic basis of ZGLP1 function 


We used chromatin immunoprecipitation se- 
quencing (ChIP-seq) to determine the genome- 
wide binding profiles of ZGLP1 in c7 or c8 


program, whereas Stra8, which functions at 
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mPGCLC-derived cells overexpressing V5-tagged 
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Fig. 4. Regulatory coverages of ZGLP1 and STRA8 for the oogenic program. 
(A) Overlap of genes up-regulated by BMP2 between the wild-type and Zglp1~’~ 
mPGCLC-derived cells. The numbers of the genes up-regulated [logo(RPM + 1) > 4, 
logo(fold change) > 2 from the control at c4, c5, c7, or c9] in the respective 
clones are shown. Key GO enrichments are shown. (B) Expression kinetics of the 
457 up-regulated genes (A) in the wild-type and Zgip]-’~ mPGCLC-derived 
cells at c4, c5, c7, and c9. Fold changes from the control culture are shown for 
the respective clones. Box plots represent values for the 25'", 50"", and 75"" 
percentiles. (C) PCA of the transcriptomes of germ cells in vivo (E9.5 to E11.5 
PGCs and E12.5 to £15.5 female or male germ cells) and wild-type and Zgip1~/~ 
mPGCLC-derived cells (G10) cultured with BMP2 and RA from c3. Each plot 
represents one transcriptome. Transcriptomes from two replicates (for E9.5 PGCs 
and E14.5 to E15.5 oocytes, three replicates) are plotted. The color code is as 
indicated. (D) Expression dynamics of Zgip1 and Stra8 in the wild-type, ZglpI-”~, 
and Stra8-’~ mPGCLC-derived cells cultured with BMP2 and RA from c3 onward 
(relative to the wild-type mPGCLC-derived cells at c9). (E) Left: Western blot 


Nagaoka et al., Science 367, eaaw4115 (2020) 6 March 2020 


(224 genes) 


Terb1, Majin, Rad211, Sycp1, Sycp3, Tex15, Ccene2 


Sirt7, Setdb1, Kdm6a, Jmjdic, Phf21a, Smarca5, Kdm4d |: Meioc, Prdm9, Mei1, Ccnb3, Rnf212, Meiob, Msh4 


Dmct 


wildtype < Zgip fo 
(281genes) 
as wildtype < Strag?— 


Stra (152 genes) 


‘Regulation of transcription 
vz Hesx1, Ncor2, Dnmt3l 


Stem cell population maintenance : 
: Poudft, Esrrb, Nanog, Prdm14, Tel1 
‘Metabolic process 

Alpi, Galk1, Gsta4, Ganab, Pfkl 


‘ Glycolytic process 
' Gpd1I, Ldha, Hexb 5 


: Regulation of transcription ; 
‘_ Tfap2c, Tett, Notch1, Ror2 


1 
1 
coe daeiea ee eine weer eee s 


analysis of the STRA8 expression at c7 in the wild-type and Zg/p1’~ mPGCLC- 
derived cells cultured with BMP2 and RA from c3 onward. Right: STRA8 levels 
(normalized by B-ACTIN) from two biological replicates. The mean value for the 
wild-type mPGCLC-derived cells is set as 1.0. Error bars indicate SDs. (F) Ex- 
pression of key oogenic and meiotic genes at c7 in the wild-type and Zgip1-’- 
mPGCLC-derived cells in the cultures as indicated. The expression in SC* cells 
sorted by FACS was analyzed by quantitative polymerase chain reaction. The AC; 
from the average C; values of two housekeeping genes, Arbp and Ppia, is shown 
for each gene. The mean values from two independent experiments, each with two 
technical replicates, are shown. Error bars indicate SDs. N.D., not detected. The 
color code is as indicated. (G and H) Overlap of differentially expressed genes 
(DEGs) [loga(RPM + 1) > 4, logo(fold change) > 2 at c4, c5, c7, or c9] between the 
wild-type and ZgipI~’~ PGCLC-derived cells (common DEGs in clone E5 and 

G10; 530 up-regulated and 281 down-regulated) and DEGs between the wild-type 
and Stra8-’- mPGCLC-derived cells (common DEGs in clone B6 and F6; 224 
up-regulated and 152 down-regulated). Key GO enrichments are shown. 
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ZGLP1 from c3 onward without or with RA 
(fig. S12, A to C, and tables S4 and S5), and 
analyzed them with reference to the chroma- 
tin modification states of cultured mPGCLCs 
at c7 (8, 27) (Fig. S12, D to F, and table S6). 
Under the condition without or with RA, we 
defined 16,576 and 14,662 ZGLP1-binding peaks 
in common in c7 and c8 cells, respectively, 
and assigned 16,290 and 14,409 peaks to the 
transcription start sites (TSSs), respectively 
(fig. S12G). Under both conditions, the ma- 
jority of the peaks (>90%) were located with- 
in 15 kb of the TSSs, and approximately half 
of them were associated with promoters, bind- 
ing to a majority of the promoters or genes 
expressed in c7 or c8 cells [~74% and ~70% of 
genes expressed at logo(RPM + 1) > 4, respec- 
tively] (fig. S12, G to I, and table S5). Accord- 
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Fig. 5. Activation of Polycomb-repressed genes by ZGLP1. (A) Percentages 
of up-regulated genes [log2(fold change) > 2 from c3, at c4, c5, c7, and c9] in 
mPGCLC-derived cells cultured with Dox from c3 onward classified on the 
basis of ZGLP1 binding and the enrichment of two histone modifications 
(H3K4me3 and H3K27me3) (see table S6). The numbers of genes for each 
category are shown in parentheses. The color coding is as indicated. 

(B) Scatter plot showing the immunoprecipitation (IP) levels for H3K4me3 
and H3K27me3 around the TSSs. The dotted lines represent the threshold IP 
levels for each histone modification (see the materials and methods). Genes 
bound by ZGLP1 and up-regulated from c3 to c4 or from c3 to c5 by Dox 
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ingly, under the two conditions, a majority 
(279%) of the ZGLP1-bound genes overlapped 
(fig. S12J and table S5), suggesting that ZGLP1 
functions at gene proximities independently 
from RA. 

To identify ZGLP1-DNA interactions critical 
to the oogenic fate, we analyzed ZGLP1 bind- 
ing to genes that were up-regulated in response 
to ZGLP1 overexpression. Upon ZGLP1 over- 
expression at c3 without RA, 41, 181, 671, and 
829 genes were up-regulated from c3 at c4, c5, 
c7, and c9, respectively (Fig. 5, A and B, and 
fig. S121). ZGLP1 bound to ~51% (21/41), ~54% 
(98/181), ~64% (426/671), and ~63% (520/829) 
of the c4, c5, c7, and c9 genes, respectively (Fig. 
5, A and B, fig. S12I, and table $5). Notably, a 
dominant fraction of the c4 genes bound by 
ZGLP1 originally bore a repressive state; ~52% 


ZGLP1-bound genes 


(11/21) and ~14% (3/21) showed bivalent mod- 
ifications and H3K27me3 enrichment, re- 
spectively, and only ~29% (6/21) exhibited 
H3K4me3 enrichment (Fig. 5, A and B, and 
table S6). This trend was also the case for the 
c5 genes bound by ZGLP1, with ~37% (36/98) 
and ~18% (18/98) being bivalent and H3K27me3 
enriched, respectively (Fig. 5, A and B, and table 
S6). By contrast, similar to the overall ZGLP1- 
binding profile in c7 and c8 cells, most (>67%) of 
the c7 and c9 genes bound by ZGLP1 were active 
genes with H3K4me3 enrichment (Fig. 5, A and 
B, and table S6). Similarly, a dominant fraction 
of the c4 and c5 genes up-regulated by ZGLP1 
and RA and bound by ZGLP1 originally bore a 
repressive state (fig. S12, K and L, and table S6). 

The H3K27me3-enriched genes bound by 
ZGLP1 exhibited higher up-regulation at c4 


up-regulated upon + Dox at c4, c5 
B Cc 
® K27 enriched ®@Bivalent 
(18 genes) (37 genes) Genes up-regulated upon + Dox 
@ Neither @K4 enriched , : 7 . 
(1 gene) (45 genes) Bivalent K27 enriched K4enriched Neither 
Tara Nrsct ZGLP1-bound = -@- eo eo eo 
= 4 <Jaf9b | -Zbtb18 unbound 
Ss Vo sige ts! ‘ Prdmg 
a Meioc-..: \ @ 1.0 
c= i 
© = -Ccndt § 
=n Bnet g 
o< Zhx1 Q2 
Os 2 
Ze 7 S05 
Q ° i Incat o 8 
ef Zps9a TS 
So 4 io Yihde2 x 
S Fbxo47) cne2 o 
= 6 T T ++ T 
6 -4 2 0 2 4 
H3K4me3 [loga(IP/Input)] 
(Ohta et al. 2017) 
E ig 12 kb [ 5.5 kb oy 5 kb ~ 
arr [0-2] : [0-3] 
Tee ety tha de oo vy S — ote : ‘ ee H 
‘ H3K4me3 1 10  [0- : 
= 110-15] » [Os] + [0-15] ‘ 
a: H ' ' 
ns : : : 
Phe ees ' eee : 
eo ' [0-2] 1 [0-2] ‘ 
a errs eee wae ae Ye 
1 < 7 < < > : BR—f—+ 8 
Zhx1 Dmrtc2 Meioc ' 


Meioc, Sycp1, Tdrd1 


treatment (101 genes in t 


ZGLP1-bound and ZGLP1- 
[logo(RPM + 1)] for each 


enriched genes bound by 


6 March 2020 


levels between c3 and c9, 
**P < 0.005. (D) GO term enrichment among bivalent and H3K27me3- 


otal) are colored as indicated. Representative 


ZGLP1 up-regulated genes are annotated. (C) Expression dynamics of 


unbound genes. The expression levels 
gene are normalized by its peak expression 
and then the median values are plotted. *P < 0.05. 


ZGLP1 and up-regulated from c3 to c4 or from c3 to 


c5. (E) ChIP-seq tracks for ZGLP1, H3K4me3, and H3K27me3 around Zhx1, 
Dmrtc2, and Meioc. The ChIP-seq data for the two histone modifications are 
from sexually uncommitted mPGCLCs during culture (8). 
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(RA) 
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RNA processing 
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Meiotic prophase 


etrotransposon 


Repression of 
PGC program 


i Folliculogenesis 
repression 


Fig. 6. Model for oogenic fate determination in mice. BMP2, most likely secreted from embryonic 
granulosa cells (5, 6), up-regulates ZGLP1 in PGCs, which activates key oogenic programs, including 


mou 


those for “chromatin modification; 


RNA processing,” “retrotransposon repression, 


mu 


folliculogenesis,” and 


“meiotic cell cycle,’ whereas RA signaling contributes to the overall maturation of the oogenic program as 
well as to “repression of the PGC program.” STRA8 mainly regulates the meiotic program in a ZGLP1- 


dependent context. 


and at c5 than those not bound by ZGLP1, 
whereas the H3K4me3-enriched genes were 
up-regulated similarly irrespective of the ZGLP1 
binding (Fig. 5C). The bivalent and H3K27me3- 
enriched genes bound by ZGLP1 and up- 
regulated by c5 were enriched with those for 
“multicellular organism development,” “trans- 
cription,” “meiotic cell cycle,” and “sex differ- 
entiation,” and included Ddx25, Zbtb18, Zhx1, 
Meioc, Sycp1, Tdrd1, Dmrtc2, Dmrtb1, Taf9b, 
and Hdacil (Fig. 5, D and E). The trend for 
the preferential up-regulation of the repressed 
genes bound by ZGLP1 at c4 and c5 was also 
observed and even appeared to be augmented 
in mPGCLCs with ZGLP1 and RA (fig. S12, 
M and N), with bivalent genes also show- 
ing higher up-regulation at c4. Collectively, 
these findings support the idea that ZGLP1 
activates key genes for the oogenic program, 
including meiotic prophase, which are in 
poised and/or repressed states with bivalent 
or H3K27me3 modifications in sexually un- 
committed germ cells. 


Discussion 


On the basis of both in vivo and in vitro 
studies in mice, we have demonstrated that 
BMP signaling and its downstream effector, 
ZGLP1, specify the oogenic program through 
the regulation of genes involved in “RNA pro- 
cessing,” “transcription and chromatin mod- 
ification,” “retrotransposon regulation,” “meiotic 
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cell cycle,” and “oocyte development,” where- 
as RA signaling contributes to the overall 
maturation of the oogenic program as well as 
to the repression of the PGC program (Figs. 3 
and 6 and fig. S9). The finding that RA plays a 
role in repression of the PGC program, which 
involves a network of TFs for pluripotency, is 
consistent with a well-known function of RA 
in stem cell differentiation (28). Thus, our 
findings provide an integrated paradigm for 
the signaling pathway and transcriptional 
mechanism for the oogenic program in mice 
(Fig. 6). Transcriptome analysis of human fetal 
germ cells and gonadal somatic cells revealed 
the potential involvement of the BMP-ZGLP1 
pathway in human oogenic fate determination 
(29), suggesting that the BMP-ZGLP1 pathway 
may play a conserved function in oogenic fate 
determination in a wide range of species. Clar- 
ification of the biochemical mechanism of action 
of ZGLP!1 will be a critical future challenge. 
We have shown that Zglp/ plays a critical 
function in allowing the spermatogonia, not to 
enter, but to efficiently accomplish the meiotic 
prophase, and only a small fraction of the 
Zglpr!/ ~ cells generate round or elongated 
spermatids (14, 15) (fig. S5). The distinct phe- 
notypes between females and males repre- 
sent a sexual dichotomy of the mechanism for 
the onset and progression of the oogenic and 
spermatogenic processes, including meiosis, 
which also warrants further investigation. 
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Notably, the Stras-/~ male germ cells show 
more severe phenotypes: depending on the 
genetic background, they either fail to enter 
the meiotic prophase itself or fail to complete 
the meiotic prophase, with no formation of 
secondary spermatocytes or round or elongated 
spermatids (30, 31). Thus, Stra8 exhibits indis- 
pensable functions in ensuring the meiotic 
prophase both in females and males. A further 
understanding of the mechanism for the sex 
determination of germ cells will serve as a key 
not only for clarifying the etiology of relevant 
disease states, including infertility, but also for 
promoting in vitro gametogenesis in diverse 
mammalian species. 
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INTRODUCTION: The brain is the most com- 
plex organ of the mammalian body, boasting 
a diverse physiology combined with intricate 
cellular organization. In an effort to expand our 
basic understanding of the neurobiology of the 
brain and its diseases, we performed a com- 
prehensive molecular dissection of the main 
regions of the human, pig, and mouse brain 
using transcriptomics and antibody-based map- 
ping. With this approach, we have identified 
regional expression profiles and observed sim- 
ilarities and differences in expression levels be- 
tween these three mammalian species. 


RATIONALE: There is a need for a comprehen- 
sive overview of genes expressed in the mam- 
malian brain categorized by organ, brain region, 
and species specificity. To address this need, a 
brain-centered knowledge resource of RNA and 
protein expression in the brain of three mam- 
malian species has been created and used for 
cell topological analysis, systems modeling, and 
data integration. The regional expression of all 
protein-coding genes is reported, and this clas- 


Genome-wide transcrip- 
tomics analysis of anatomi- 
Cally dissected regions in 
mammalian brains un- 
covers regional and 
species-specific expres- 
sion. Multiple regions of the 
human, pig, and mouse brain 
were dissected and analyzed. 
A uniform manifold approxi- 
mation and projection 
(UMAP) analysis (middle) 
shows the global expression 
patterns of 1710 samples in 
the human brain, with the 
cerebellum as the outlier. 
The HPA Brain Atlas (right) 
shows the expression of indi- 
vidual genes, for example, 
synaptosomal-associated 
protein 25 (SNAP25), in the 
different brain regions in the 
three mammalian species. 


Body and brain 


@ Olfactory bulb 
@Hypothalamus 
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sification is integrated with results from the 
analysis of tissues and organs of the whole 
human body. All generated data, including high- 
resolution images and metadata, have been 
made publicly available in an open-access Hu- 
man Protein Atlas (HPA) Brain Atlas. 


RESULTS: The global analysis suggests similar 
regional organization and expression patterns 
in the three mammalian species, consistent 
with the view that basic brain architecture is 
preserved during mammalian evolution. How- 
ever, there is considerable variability between 
species for many neurotransmitter receptors, 
in particular between human and mouse. This 
calls for caution when using the mouse as a 
model system for the human brain, for exam- 
ple, in attempts to develop therapeutic strat- 
egies. For some of the brain regions, such as 
the cerebellum and hypothalamus, the human 
global expression profile is closer to that of the 
pig than it is to that of the mouse, suggesting 
that the pig might be considered a preferred 
animal model to study many brain processes. 


We show that many “signature genes” iden- 
tified previously for specific brain cell types 
(such as astrocytes, microglia, oligodendrocytes, 
and neurons) are expressed at even higher levels 
in peripheral organs. In fact, our results support 
a view of shared functions between many genes 
in microglia and immune cells, and a large 
number of genes previously identified as signa- 
ture genes for astrocytes are shown to be shared 

with liver or skeletal mus- 
cle. The cerebellum stands 
Read the full article Out as having a distinct 
at http://dx.doi. molecular signature with 
org/10.1126/ many regionally enriched 
science.aay5947 genes. Several genes sug- 
ie nein etal gested toche involved in 
neuropsychiatric diseases are selectively ex- 
pressed in the cerebellum. 


CONCLUSION: The integration of data from sev- 
eral sources has allowed us to combine data 
from transcriptomics, single-cell genomics, in 
situ hybridization, and antibody-based protein 
profiling. This integrative approach for mapping 
the molecular profiles in the human, pig, and 
mouse brain has generated a detailed multi- 
level genome-wide view on the protein-coding 
genes of the mammalian brain, where we com- 
pared tissue specificity across the whole body, as 
classified in the HPA (www. proteinatlas.org). The 
open-access HPA Brain Atlas resource offers the 
opportunity to explore individual genes and 
classes of genes and their expression profiles in 
the various parts of the mammalian brain. 


The list of author affiliations is available in the full article online. 
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}Corresponding author. Email: mathias.uhlen@scilifelab.se 
(M.U.); jan.mulder@ki.se (J.M.) 

Cite this article as E. Sjostedt et al., Science 367, eaay5947 
(2020). DOI: 10.1126/science.aay5947 


Regions Species 
Expression level 
Cortical regions for SNAP25 
Human 
45 #, 
04 $ 
x Hypothalamus 
4 120 5 Pig 
= Jt K ; | 
-4 | Basal ganglia ] 
“8 120 Mouse 
Cerebellum 
- = HPA 80 
i re 40: 
i -12 4 — a e FANTOM 
; 1 1 1 0 
-3 0 3 6 
UMAP1 
@Cerebral cortex @Hippocampal formation @Amygdala @Basal ganglia 
@Thalamus @Midbrain @Pons & medulla Cerebellum 
lofl 


RESEARCH 


RESEARCH ARTICLE 


INTEGRATIVE OMICS 


An atlas of the protein-coding genes in the human, 


pig, and mouse brain 


Evelina Sjéstedt!?, Wen Zhong’, Linn Fagerberg’, Max Karlsson’, Nicholas Mitsios?, Csaba Adori?, 
Per Oksvold’, Fredrik Edfors”, Agnieszka Limiszewska’, Feria Hikmet®, Jinrong Huang*>©”, 

Yutao Du®®, Lin Lin**®, Zhanying Dong*>, Ling Yang*>*, Xin Liu®®, Hui Jiang®, Xun Xu>*, 

Jian Wang®'®, Huanming Yang®, Lars Bolund*>, Adil Mardinoglu’, Cheng Zhang’, 

Kalle von Feilitzen’, Cecilia Lindskog®, Fredrik Pontén®, Yonglun Luo*®*, Tomas Hokfelt’, 


Mathias Uhlén'?*+, Jan Mulder**+ 


The brain, with its diverse physiology and intricate cellular organization, is the most complex organ of 
the mammalian body. To expand our basic understanding of the neurobiology of the brain and its 
diseases, we performed a comprehensive molecular dissection of 10 major brain regions and multiple 
subregions using a variety of transcriptomics methods and antibody-based mapping. This analysis was 
carried out in the human, pig, and mouse brain to allow the identification of regional expression profiles, 
as well as to study similarities and differences in expression levels between the three species. The 
resulting data have been made available in an open-access Brain Atlas resource, part of the Human 
Protein Atlas, to allow exploration and comparison of the expression of individual protein-coding genes in 


various parts of the mammalian brain. 


he brain is an extraordinarily complex 

organ owing to its diverse physiology, 

complex cellular organization, and abun- 

dance of expressed genes. Identifying the 

molecular organization of the brain at 
regional, cellular, and subcellular levels will 
advance our understanding of its function 
under normal and diseased conditions. The 
Human Protein Atlas (HPA) program aims to 
combine antibody-based profiling with genome- 
wide transcriptomics analysis to explore the 
spatial expression levels of transcripts and 
proteins across cells, tissues, and organs (1). 
The Tissue Atlas (J, 2)—a subsection of the 
HPA—includes only a limited number of hu- 
man brain regions (the cerebral cortex, hip- 
pocampus, caudate nucleus, and cerebellum). 
Here, we describe genome-wide expression 
profiles for the protein-coding genes in 10 
major well-defined mammalian brain regions 
to capture the complexity of the cellular or- 
ganization. To identify differences and sim- 
ilarities of the brain in different phylogenetic 
orders, the expression profiles have been an- 
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alyzed in three species: primates (human), 
Cetartiodactyla (pig), and Rodentia (mouse). 

The effort described here is complementary 
to several brain mapping projects focused on 
basic organization and regional or cellular gene 
expression of the mammalian brain. The Allen 
Institute for Brain Science (https://alleninstitute. 
org) hosts several knowledge resources, in- 
cluding an in situ hybridization atlas of the 
adult (3) and developing (4) mouse brain; and 
a microarray-based atlas of the adult human 
brain (5) has been complemented with a map 
of the human brain during development (6). 
More recently, brain atlas strategies have been 
launched on the basis of different approaches: 
fluorescence-activated cell sorting in mouse 
(7), antibody-based cell sorting in human (8), 
single-cell gene expression in mouse (9) and 
human (JO, 11), and covariation analysis of 
transcriptomics expression (12). These efforts 
have been further complemented with several 
large-scale mapping programs, including the 
National Institutes of Health (NIH) BRAIN 
Initiative Cell Census Network (13), the Euro- 
pean Human Brain Project (/4), the NIH Hu- 
man BioMolecular Atlas Program (15), and the 
Human Cell Atlas project (6). 

Here, we present the HPA Brain Atlas (17), 
where the data collected have been used for 
cell topological analysis, systems modeling, 
and data integration, with the aim to create a 
knowledge resource of messenger RNA and pro- 
tein expression in the mammalian brain. We 
complement the transcriptomics with antibody- 
based protein profiles of selected proteins in 
multiple regions of the mouse brain. In this 
open-access resource, transcriptomics data from 
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three external sources—the Genotype-Tissue 
Expression (GTEx) portal (78), the Function- 
al Annotation of Mammalian Genomes 5 
(FANTOMS) project (19), and the Allen Mouse 
Brain Atlas (3)—are presented together with 
RNA profiles and protein stainings generated 
“in-house.” The classification of all protein- 
coding genes with regard to brain regional 
specificity is reported, and this is integrated 
with the tissue and organ specificity across the 
human body. 


Transcriptomics analysis of the human brain 


Transcriptomics analysis was performed on 
anatomically dissected human, pig, and mouse 
brain regions (Fig. 1A and figs. S1 to $3). For 
the human brain, we integrated publicly avail- 
able RNA sequencing (RNA-seq) data gener- 
ated by the GTEx consortium (78) and cap 
analysis of gene expression (CAGE) data from 
the FANTOM consortium (19), with data from 
the HPA (J), for a total of 1710 samples from 
selected human brain regions (table S1). The 
combined dataset contains 23 human brain 
regions, including white matter (corpus callo- 
sum) and spinal cord, as outlined in Fig. 1B. 
Several issues complicate the combining of 
datasets. First, samples may not be homoge- 
neous, especially for regions with a high level 
of cellular heterogeneity, such as the hypo- 
thalamus, midbrain, pons, and medulla oblong- 
ata. Furthermore, both HPA and GTEx data are 
based on RNA-seq protocols using polyadenyl- 
ate [poly(A)] tail enrichment, whereas CAGE 
data are based on the selection and sequenc- 
ing of the 5’ cap. As a result, genes lacking the 
poly(A) tail, such as canonical histone mRNA, 
are only detected by CAGE. Despite these com- 
plications, the large number of included sam- 
ples and our gene classification approach enable 
us to generate a comprehensive overview of bio- 
logically relevant gene expression and regional 
and species variation. 

We used normalization strategies to avoid 
batch effects caused by sampling, technology 
platforms, and differences in transcriptome 
size between different types of tissues and 
also to allow both within-sample and between- 
sample comparisons (20, 21). The within-sample 
normalization was based on protein-coding 
transcripts per million (pTPM), while the 
between-sample normalization was based on 
trimmed means of M values (TMM) (22), Pareto 
scaling per gene (23), and limma (24), resulting 
in normalized expression (NX) values calculated 
for all genes across all tissue types, as out- 
lined in Fig. 1C and described in detail in the 
supplementary information (figs. S4 to S6). 

The uniform manifold approximation and 
projection (UMAP) (fig. $7) of all 1710 human 
brain samples shows the expected global ex- 
pression patterns after normalization. Devel- 
opmentally related anatomical regions cluster 
together, with the cerebellum being an outlier 
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Fig. 1. Genome-wide transcrip- 
tomics analysis of anatomically 
dissected regions in mamma- 
lian brains. (A) Multiple regions 
of the human, pig, and mouse 
brain were dissected and analyzed 
using transcriptomics methods. 
(B) A summary of the included 
brain subregions, with 23 human, 
30 pig, and 17 mouse samples, in 
10 main brain regions (for an 
anatomical overview, see figs. S1 
to S3). The subregions are as 
follows: olfactory bulb, ob; 
prefrontal cortex, pf; frontal lobe, 
fr; motor cortex, mo; cingulate 
cortex, cg; retrosplenial cortex, 
rt; somatosensory cortex, ss; 
paracentral gyrus, pa; postcentral 
gyrus, pc; temporal lobe, 

tp; insula cortex, in; occipital lobe, 
oc; entorhinal cortex, en; subicu- 
lum, sb; amygdala, am; hippo- 
campus, hc (ventral, hv, and 
dorsal, hd); nucleus accumbens, 
na; ventral pallidum, vp; globus B 
pallidus, gp; putamen, pu; caudate 
nucleus, cn; caudate putamen, 

cpu; septum, sp; hypothalamus, 

hy; thalamus, th; substantia nigra, 

sn; midbrain, mb; superior 

colliculus, sc; periaqueductal gray, 

pg; pons, po; locus coeruleus, 

Ic; medulla oblongata, 


fos 
my; cerebellum, cb; corpus 
callosum, cc; spinal cord, 


spc (dorsal, sd, and ventral, sv). (C) Overview of the d 


Structure 


RNAseq 


f 


shown, as well as genes overlapping and nonoverlapping between datasets (see fig. S5 for extended version). 


compared with other brain regions. No bias 
between the different platforms for transcrip- 
tomics analysis (HPA, GTEx, and FANTOM) 
was observed. The expression data for all an- 
alyzed human brain regions covering 19,670 
human protein-coding genes are presented 
in a gene-specific manner in the HPA Brain 
Atlas (see below). The regional expression data 
in the human brain include 15,157 protein- 
coding genes detected in at least one region of 
the brain, ranging from 13,068 to 14,332 ex- 
pressed genes per brain region (fig. S8). 


Transcriptomics analysis of the pig brain 


Brain transcriptome analysis of two male and 
two female adult pigs (Bama minipig, aged 
1 year) was performed for anatomically dis- 
sected brain regions covering the whole brain, 
as outlined in table S2. The pig brain was di- 
vided into 30 anatomically defined brain regions 
(Fig. 1B). A normalization protocol using TMM 
and Pareto scaling was used, as outlined in 
fig. S4. A UMAP analysis of the transcript ex- 
pression profiles of all samples (fig. S9) indi- 
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ata normalization approach, combining five separate datasets. Total gene numbers for respective datasets are 


cates the overall similarity between subregions 
and illustrates the expression variation be- 
tween the cerebrum regions and regions of the 
brainstem. On the basis of the pig gene build 
Ensembl 92 and a detection cutoff at NX = 1,a 
total of 18,686 genes were detected in the pig 
brain, with 15,601 to 17,394 genes detected in 
individual brain regions (fig. S10). Expression 
data for 14,656 protein-coding genes with a one- 
to-one pig ortholog can be found in the gene- 
specific pages of the HPA Brain Atlas (17). 


Transcriptomics analysis of the mouse brain 


A genome-wide transcriptomics analysis was 
performed on multiple regions of two male 
and two female adult mice (C57bl/6n, aged 
2 months). The mouse brain was divided into 
17 anatomically defined brain regions (table 
$3). A normalization protocol using TMM and 
Pareto scaling was used, as outlined in fig. S4. 
The UMAP plot of the global expression pat- 
terns shows the expected pattern with devel- 
opmentally related anatomical regions clustering 
together (fig. S11). On the basis of a cutoff for 


2020 


detection at NX = 1, a total of 15,823 brain-ex- 
pressed mouse genes were detected, with 12,977 
to 14,402 genes per brain region (fig. $12). Data 
for 15,160 protein-coding genes with a mouse 
one-to-one ortholog are presented in the gene- 
specific pages of the HPA Brain Atlas (17). 


Genome-wide classification of all 
protein-coding genes based on regional 
brain expression 


Expression data for the various brain regions 
of the three species were summarized into 10 
main regions (Fig. 2, A to C). On the basis of 
the maximum expression in any of the analyzed 
subregions, a consensus result of the 10 regions 
for three species was generated. These re- 
gions are the olfactory bulb, all cerebral cor- 
tex regions, subfields of the hippocampus, the 
amygdala, regions of the basal ganglia, the hy- 
pothalamus, the thalamus, subfields of the 
midbrain, the pons and medulla oblongata, 
and the cerebellum (Fig. 1B). A hierarchical 
clustering of the 10 main regions was per- 
formed using the global expression profiles 


2 of 16 


RESEARCH | RESEARCH ARTICLE 


cortical regions 
54 
eR al 
eS: Rie Sica S 
‘i a ry ganglia 
hippocampus =e 
0-4 
nN brainstem 
Pe hypothalamus 
= 
> 
54 
cerebellum 
-104 "HPA 
4 GTEXx #h Ca 4 
+ FANTOM “i 
“6 “3 0 3 6 
UMAP1 
oo aa say : 
o 


Qn 
So 


cortical regions 
s * bas bad 
ee ; oe Pe 
cortical regions ° Cae ae 
254 . e ee 2 e aaa 
ee e e oo . a) e 
* e ‘ basal ganglia hippocampus 
2 ee ° 
hd . ° ° 
a . & 0 : * basal ganglia 
S 0.04 ee hi S i 
5 . hippocampus s 
cerebellum hypothalamus 
of : ° 
olfactory bulb a olfactory bulb 
hypothalamus -2 
-2.5 eae 
° » brainstem 
*s brainstem 
Ta . eR -4) | cerebellum >. 
=1 0 2 2 0 2 
UMAP1 UMAP1 
o oO 
wo Q 
= a 
o oO 


(6b) olfactory bulb BB cerebral cortex {fp} hippocampal formation 


im) amygdala (69) basal ganglia @ rypothatamus (ff) thatamus (89 midbrain {Bim pons & medulla (cb) cerebellum 


Fig. 2. Regional comparison based on global expression in three mammalian species. (A) Uniform manifold approximation and projection (UMAP) analysis 
showing the global expression patterns of all samples in 10 human brain regions (1616 total) from HPA, GTEx, and FANTOM. (B) UMAP plot of pig brain samples in 
10 regions (107) used for mapping regional transcript expression in the pig brain. (©) UMAP plot of mouse brain samples in 10 regions (64) analyzed in this 
study from the mouse brain. (D) Hierarchical clustering based on pair-wise Spearman correlation of the transcript expression levels in 10 main brain regions is shown. 
(E and F) Same as (D), but for pig and mouse brain regions, respectively. 


of all the protein-coding genes. The results for 
each of the three mammalian brains are shown 
in Fig. 2, D to F, with details in fig. S13. The 
hierarchical trees show a similar structure, 
with the cerebellum as an outlier in all three 
species and with the three cerebrum regions 
(cerebral cortex, hippocampus, and amygdala) 
close together, next to the basal ganglia. Sim- 
ilarly, the three brainstem regions (midbrain, 
thalamus, and pons and medulla) cluster to- 
gether in all three species, next to the hypo- 
thalamus. The analysis confirms that the global 
expression patterns in the different regions 
of the three mammalian brains are preserved 
during mammalian evolution. 

To identify regionally specific molecular fea- 
tures, regionally elevated genes were classified 
according to their expression across the 10 main 
brain regions. Elevated genes were further strat- 
ified into regionally enriched (fourfold higher 
expression compared with any other brain re- 
gion), group enriched (several brain regions 
with fourfold higher values than all other re- 
gions), and regionally enhanced (fourfold higher 
expression than the average expression of the 
10 regions). Genes not elevated in a single re- 
gion or group of brain regions are classified 
as genes with low regional specificity (the clas- 
sification is described in detail in table S4). 
This classification was performed across all 
protein-coding genes on the basis of NX val- 
ues. The numbers of regionally enriched, group 
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enriched, and regionally enhanced genes are 
shown in fig. $14 and in the HPA Brain Atlas 
resource (see below). Heatmaps show the dis- 
tribution of regionally enriched, group-enriched, 
and regionally enhanced genes in the 10 regions 
of the human (fig. S15), pig (fig. S16), and mouse 
(fig. S17) brain. In all three species, the cere- 
bellum contains the largest number of re- 
gionally enriched genes, while most group 
enriched genes are shared among the regions 
of the cerebrum and brainstem, respectively 
(fig. S18). 


Comparative analysis of transcriptomics, 
in situ RNA hybridization, and 
immunofluorescence protein staining 


A comprehensive and extensively used mouse 
brain gene expression atlas has been gener- 
ated by the Allen Institute using probe-based 
in situ transcriptomics (3). In the HPA Brain 
Atlas, we have integrated expression profiles 
from the Allen Brain Atlas for all mouse genes 
(with a human one-to-one ortholog) with the 
HPA-generated RNA-seq and antibody-based 
protein distribution data. The two transcrip- 
tomics sets are highly complementary, because 
RNA-seq expression data provide sensitive 
quantitative transcript information, although 
these data have the disadvantage that mixtures 
of cell types are analyzed. The in situ hybridiza- 
tion data provide spatial expression data on a 
single-cell level, but this probe-based method 
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is less quantitative than the count-based RNA- 
seq method. In addition, for selected proteins, 
an immunofluorescence protein distribution 
map was generated, allowing visualization of 
protein distribution on a cellular level, includ- 
ing neuronal processes, with high spatial res- 
olution. An advantage of this protein staining 
is that anatomically stacked images can be 
generated, and this has allowed us to anno- 
tate more than 120 regions and subfields of 
the brain. Together, the three complementary 
datasets provide genome-wide regional pro- 
files of the protein-coding genes and their ex- 
pression in the different regions of the brain. 

The results are displayed in the HPA Brain 
Atlas, and this resource allows for compar- 
isons of the HPA data (RNA-seq), the probe- 
based in situ hybridization (ISH), and the 
antibody-based protein immunofluorescence 
(IF) staining for all 10 regions of the mouse 
brain, as exemplified for five genes in Fig. 3A. 
Insulin-like growth factor binding protein 5 
(IGFBP5) is shown to be expressed in all ana- 
lyzed regions of the mouse brain according 
to all datasets. However, both the mRNA loca- 
tion (ISH) and immunoreactivity (IF) reveal 
a distinct expression pattern in the mouse 
olfactory bulb with expression in mitral cells, 
localized both in soma and proximal dendrites. 
For NECABI1, an N-terminal EF-hand calcium 
binding protein with unknown function, brain- 
wide expression is also observed. The ISH and 
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Fig. 3. Comparison of mouse regional data from transcriptomics, in situ 
hybridization, and immunofluorescence. (A) Examples of expression 
profiles from the HPA Brain Atlas, with mouse expression shown as data from 
RNA-seq (this study), in situ hybridization (ISH) data from the Allen Brain 
Atlas, and regional staining intensity based on antibody-based immuno- 
fluorescence (IF) profiling (this study). The color codes are the same as in 
Fig. 2. Below are examples of ISH and IF staining for each gene, from left to 
right: insulin-like growth factor binding protein 5 (IGFBP5); N-terminal 
EF-hand calcium binding protein 1 (NECAB1); norepinephrine transporter, also 


called solute carrier family 6 member 2 (NET1 or SLC6A2); adenylate cyclase— 
activating polypeptide 1 (ADCYAP1); and G protein-coupled receptor 151 
(GPR151). Glomerular layer, gl; external plexiform layer, epi; mitral cell layer, ml; 
granule cell layer, gra; locus coeruleus, Ic; central amygdala, ca; cortical 
amygdala, coa; lateral habenula, Ihb. (B) iDISCO+ volume immunostaining of a 
whole mouse brain for GPR151 receptor. The medial habenula (mhb), fasciculus 
retroflexus (fr), interpeduncular nucleus (ip), and parabrachial-pericoerulear 
region (pb-pc) are strongly stained. The boxed region in the top image is 
enlarged in the bottom image. Scale bars, 250 um. 


the IF data indicate a distinct neuronal ex- 
pression of NECAB1 in subsets of neurons in 
various regions of the thalamus and forebrain, 
including pyramidal neurons in the cerebral 
cortex. The (nor)epinephrine uptake trans- 
porter (SLC6A2), also called NET], is an ex- 
ample of an apparent partial discrepancy 
between the RNA and the protein location: 
The RNA transcript is detected in cell bodies 
of the locus coeruleus in the pons with an 
expression pattern resembling that of the 
well-characterized (nor)epinephrinergic neu- 
rons. However, this transporter protein can- 
not be detected in the cell bodies with IF. This 
is because NET1 is rapidly transported into the 
extensive axonal network. Adenylate cyclase- 
activating polypeptide 1 (ADCYAP1) is known 
to stimulate the generation of cyclic adenosine 
monophosphate (cAMP), and all three data- 
sets show widespread expression across the 
brain regions, with the highest levels in the 
hypothalamus and amygdala. In the latter 
region, ISH shows that this gene is expressed 
by cells located in the cortical amygdaloid 
nucleus, whereas protein labeling is found 
in nerve terminals in the central amygdaloid 
nucleus, which is known to receive an input 
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from the cortical amygdala. The results sug- 
gest that this protein is primarily presynaptic 
and support a role in cAMP-mediated synap- 
tic plasticity. Expression of the orphan G 
protein-coupled receptor 151 (GPR151) can be 
visualized both with ISH and IF in neuronal 
cell bodies in the habenular nucleus of the 
mouse thalamus. However, the protein stain- 
ing also shows the projection from the thal- 
amus to the interpeduncular nucleus in the 
midbrain. This orphan receptor is thus visual- 
ized in the neuronal soma, in axons running in 
the fasciculus retroflexus, and in the presyn- 
aptic terminals. This expression pattern can 
also be shown using three-dimensional imag- 
ing of solvent-cleared brain (iDISCO) en- 
compassing a whole mouse hemisphere (Fig. 
3B). In addition, this analysis revealed that 
a portion of the axons pass the interpedun- 
cular nucleus and innervate the parabrachial- 
pericoerulear region. Movie S1 shows the 
three-dimensional location of this orphan 
receptor. Together, these examples illustrate 
how combining three different approaches for 
spatial transcriptomics and proteomics results 
in insights offering detailed information on 
cellular expression and protein location. 
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Species comparisons of regional brain 
expression 

To compare the expression profiles in the 
three mammalian brains, all genes with one- 
to-one orthologs in human, mouse, and pig 
were identified, and a total of 12,999 protein- 
coding genes were analyzed (fig. SI9A). Addi- 
tional genes can be included in the analysis 
in the future, as additional one-to-one gene 
orthologs are identified. A combined hierar- 
chical tree including all regions of the three 
species based on all regionally elevated genes 
across the 10 main regions is shown in Fig. 4A. 
The results again support a preserved brain ar- 
chitecture, where the hypothalamus and cer- 
ebellum of all three species cluster in proximity 
to each other. Similarly, the brainstem regions 
and the cerebrum regions cluster together. 
Neighboring regions from one species cluster 
together, but clustering is less tight for cor- 
responding regions from different species. The 
olfactory bulbs from pig and mouse are clus- 
tered tightly together, and the outlier is the 
olfactory bulb from humans, which shows 
similarities with the cerebrum regions of hu- 
mans. This might be due to sampling error, as 


discussed above, but could also reflect the 
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Fig. 4. Species comparison of regional A 
expression in the mammalian brain. 

(A) The expression levels of 1422 genes 
classified as regionally elevated in either 
human, pig, or mouse were used for 
hierarchical clustering analysis, showing 

the relationship of the 10 main brain 

regions from the three species. 
(B) A heatmap showing the expression 
levels in the different brain structures 
in human (H), pig (P), and mouse (M) 
brain of enriched genes shared by 

the three species, based on brain 
structure comparison shown in Venn 
diagrams (fig. S19B). The same 
expression data, but visualized with 
the 10 regions of the brain, are shown in 
fig. S20. (€) Examples of regionally 
enriched genes in the mammalian brain 
and the protein location shown using 
immunofluorescence. GNAL is enriched 
in basal ganglia of human and pig brain, 
and this protein shows highest expression 
in basal ganglia (neuropil) in all three 
species. TMEM266 is cerebellum- 
enriched in all three species and located 
in synaptic glomeruli of the granular 
ayer. (D) SCGN is expressed in the 
olfactory bulb in all three species with 

a higher expression in granule cells (gl) in 
the mouse and pig. In cerebellum, 
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SCGN is only expressed in the molecular layer (ml) of the human and is not detected in pig or mouse cerebellum. In contrast, the transcription factor TFAP2B, 


coexpressed with SCGN in human, is expressed in the molecular layer of the cerebellum in all three species. 


more extended pig and rodent olfactory sys- 
tems, which have olfactory bulbs that are 
larger than the more rudimentary human 
olfactory bulbs. 

To analyze the differences in elevated genes 
in the three species, the regions were organized 
into four main brain structures (cerebrum, 
brainstem, cerebellum, and hypothalamus) 
(Fig. 4A), and the molecular features of each 
brain structure shared by all three species were 
identified. Genes with the highest expression 
in the same brain structure in all three spe- 
cies with enriched expression in at least one 
species were identified, and a list of 537 genes 
was obtained (fig. S19B). Heatmaps of the 
expression levels of these 537 brain structure- 
enriched genes are displayed in Fig. 4B and 
fig. S20, demonstrating the similarity of ex- 
pression pattern for these genes across the 
regions of the brain in the three species. Many 
known genes are found among these 537 genes 
with brain structure-enriched expression, in- 
cluding the neuropeptides galanin, oxytocin, 
and vasopressin (hypothalamus); transcription 
factors such as T-box brain protein 1 (TBR1), 
special AT-rich sequence-binding protein 2 
(SATB2), and neurogenic differentiation fac- 
tor 6 (NEUROD6) (cerebrum); and hox genes 
(brainstem), but less well characterized genes 
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are also found (table S5). The former include 
the G protein subunit alpha L (GNAL), which 
is highly expressed in the basal ganglia and 
known to couple to adenosine A2A and dopa- 
mine type 1 receptors (Fig. 4C) (25). The pro- 
tein staining suggests synaptic location in the 
caudate nucleus or caudate putamen of all 
three species. Similarly, cerebellum-enriched 
transmembrane protein 266 (TMEM266, also 
known as HVRP1) is selectively detected in the 
synaptic glomeruli in the granular layer in all 
three species (Fig. 4C). This detection is in line 
with the reported role of this postsynaptic pro- 
tein in the communication between mossy 
fibers and granule cells (26). 

The normalized data also allowed us to 
identify genes with differential brain expres- 
sion across the three species. Volcano plots 
show the overall fold difference in gene ex- 
pression based on the 10 regions (figs. S21 to 
$23 and table S6), and these data were com- 
bined in scatterplots (figs. S24 to S26) showing 
species-specific molecular features (one versus 
two species). Many proteins show similar ex- 
pression in the three species, as exemplified 
in Fig. 4D for transcription factor AP-2-beta 
(TFAP2B) expressed by y-aminobutyric acid- 
releasing (GABAergic) interneurons (27), in- 
cluding stellate cells, in all three species. 


6 March 2020 


External plexiform layer, pl. Scale bars, 50 um. 


However, many differentially expressed genes 
associated to specific brain functions could 
also be identified, such as the low expres- 
sion of the astrocytic genes glial fibrillary 
acidic protein (GFAP) and clusterin (CLU) in 
mouse compared with human and pig. For 
each of the 10 brain regions, a triangle plot 
indicates the relative expression of each gene 
in the three species (fig. S27). As an example, 
secretagogin (SCGN) is an EF-hand calcium 
binding protein expressed in the olfactory 
bulb (28) that is also seen in the stellate cells in 
the molecular layer of the human cerebellum. 
This contrasts with pig and mouse, where this 
protein cannot be detected in the cerebellum 
(Fig. 4D). 


The neurochemical architecture of the 
mammalian brain 


Brain functions are driven by complex circuits 
composed of different types of neurons with 
chemical phenotypes adapted to receive and 
generate signals. To identify species similar- 
ities and variations that characterize these 
types of neurons and their neurotransmitter 
systems, as well as other classes of cells, we 
analyzed the distribution of cell identity genes 
in all three mammalian species. These include 


(i) transcription factors (7 = 1053 genes), which 
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are essential for differentiation and mainte- 
nance of cell identities in the postmitotic brain 
(29); (ii) cell identity genes, including neuro- 
peptides, proteins, and enzymes responsible 
for the production, transport, and clearance of 
neurotransmitters (7 = 63); and (iii) all known 
neurotransmitter and neuropeptide receptors 
and receptor subunits (n = 118). Comparing 
the correlation values between species for 
these protein classes reveals a higher correla- 
tion for transcription factors (P < 0.001), cell 
identity genes (P < 0.001), and receptors (P < 
0.01) relative to the gene expression of all other 
11,765 genes with their one-to-one ortholog 
(Fig. 5A). 

We found that expression of some transcrip- 
tion factors is highly conserved across the 


Fig. 5. Expression profiles of cell identity genes in 
the mammalian brain. (A) Overall Pearson correlation 
between species for transcription factors (red), 

genes involved in the production and processing of 
neurotransmitters and neuropeptides (blue), and 
metabotropic and ionotropic neurotransmitter and 
ropeptide receptors (green) in contrast to all other 
nes (dark blue). (B) Examples showing the relative 
expression in the 10 regions of human, pig, and mouse 
ins display elevated expression in developmental 
anatomical defined regions of the brain. 

(C) Comparing the relative distribution of the mono- 
aminergic systems and (D) selected neuropeptide 
genes reveals a conserved pattern of expression, 
especially of the enzymes responsible for the produc- 
tion of dopamine (TH, DDC), noradrenaline (+DBh), 
adrenaline (+PNMT), and serotonin (TPH2) as well as 
the opioid peptide proenkephalin (PENK). Although 
many neurotransmitter receptors show a similar 
distribution profile [DDRs, ADRs, HTRIA, and 5- 
hydroxytryptamine receptor 6 (HTR6)], several excep- 
tions with clear on/off differences between species 
could be observed, especially in the cerebellum 
(HTR5A, OPRMI, OPRD1, CCKBR, and NPFFR1). 

(E) Relative expression of all GABA, receptor subunits 
in the cerebral cortex and cerebellum and (F) nicotinic 
receptor subunits in all three species suggests 
alternative nicotinic subunit composition in different 
species, especially the pig. (G) Four brain-elevated 
orphan GPCRs (GPR6, GPR52, GPR88, and GPR149) 
are elevated in the caudate and putamen, 

whereas GPR150 is group-enriched in the forebrain 
regions. The nonvisual photoreceptor melanopsin 
(OPN4) is only expressed in the human basal 


and 


ganglia. Missing values (gray bars) are due to missing 


one-to-one orthologs or genes not included in 

all used datasets. (H) The relative distribution 

of brain-elevated GPCRs with unknown function 
reveals widespread expression of the orphan 
GPR37L1 and GPR162, including the cerebellum, 
whereas GPRCSC expression is elevated in cerebellum. 
Scale bars, 40 um. Full overviews with NX for all 
transcription factors, neurotransmitters, and GPCRs 
are available in figs. S28 to S30. 
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three species, while other transcription factors 
have a less-maintained expression profile, thus 
affecting the overall correlation (fig. S28). Ex- 
amples of some of the transcription factors 
with conserved distribution across the brain 
regions are illustrated in Fig. 5B. Homeobox 
protein 1 (EMX1), known to be expressed by 
most of the neurons in the cerebral cortex 
and hippocampus (30), has a similar expres- 
sion pattern and expression levels in all three 
species. Class E basic helix-loop-helix protein 
22 (BHLHE22, or Bhlhb5), which regulates 
postmitotic differentiation of cortical neurons 
(3D), has a highly similar expression pattern in 
the cerebrum regions. In contrast, expression 
of the transcription factor SIX6 is restricted to 
the hypothalamus of all three species, which 
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is in line with earlier reports (32). Pituitary 
homeobox 3 (PITX3) is mainly expressed in 
the midbrain in all three species, with the 
highest expression in the substantia nigra in 
pig, also supporting earlier observations that 
PITX3 plays a role in the development of do- 
paminergic neurons of the substantia nigra in 
mice (33). Homeobox protein Hox-B5 (HOXB5) 
is specifically expressed in the pons-medulla 
oblongata region, supporting the observation 
that Hox genes are expressed in the hindbrain 
and known to be important for the segmental 
patterning of this part of the brain. Neurogenic 
differentiation factor 1 (NEUROD1) is essential 
for the development of the cerebellum in rats 
(34). We find that this transcription factor is 
restricted to the cerebellum in all three species 
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(Fig. 5B). Notably, NEUROD1 is also expressed 
in retina of mouse and pig but not human 
(see gene-specific page of HPA Brain Atlas). 
An example of a transcription factor with a 
differential regional expression pattern is 
eomesodermin (EOMES, or TBR2), which has 
a high expression level in the human cerebel- 
lum but is expressed mainly in the olfactory 
bulb in the mouse brain. In pig, however, this 
transcription factor is expressed at equal levels 
in the cerebellum and olfactory bulb. Overall, 
these results show an evolutionarily preserved 
distribution and regulatory role of some of the 
transcription factors, which likely provide a 
foundation for basic brain architecture during 
evolution. However, these data also reveal sub- 
stantial species variation in the distribution of 
transcription factors, including many unchar- 
acterized transcription factors not yet linked 
to a cell type. 


The neurotransmitters 


The expression patterns of the proteins involved 
in brain signaling were analyzed in the various 
regions of the human, pig, and mouse brain, 
revealing a strong correlation between all 
species for genes essential for the production 
and transport of neurotransmitters and pep- 
tides (Fig. 5A). This confirms the known sim- 
ilarities between species with regard to the 
distribution of cell types and functions in the 
brain. In Fig. 5C, the relative distribution of 
molecular components of the neuromodula- 
tory monoaminergic systems is shown, con- 
sisting of the (nor)adrenergic, dopaminergic, 
and serotonergic systems. In general, the en- 
zymes responsible for the production of these 
neurotransmitters are distributed similarly in 
the three species, indicating a conserved or- 
ganization in the mammalian brain, although 
there are some exceptions that might have 
neuropharmacological implications. For exam- 
ple, lysergic acid diethylamide (LSD) has a high 
affinity for several serotonin [5-hydroxytryptamine 
(5-HT)] receptors (35). Rodents have genes 
coding for serotonin receptors 5A and 5B 
(HTR5A and HTR5B) (36), whereas humans 
and pigs lost the gene for HTR5B during 
evolution. In mouse, HTR5A and HTRS5B are 
expressed throughout the brain, with lowest 
expression in the cerebellum. Human and pig 
show a similar distribution to each other, 
with the highest expression of HTR5A in the 
cerebellum (Fig. 5C). 

Another example of species differences is 
tyrosine hydroxylase (TH), the rate-limiting 
enzyme in the synthesis of the catecholamines 
dopamine, noradrenaline, and adrenaline. TH 
is expressed in the mouse olfactory bulb in a 
substantial number of mainly periglomerular, 
often GABAergic neurons, but this transcript 
is under the detection cutoff in human and 
pig. The human olfactory bulb does contain 
TH-positive neurons (37) and, as mentioned 
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previously, the human results on the bulb 
may be compromised by dissection problems. 
Notably, the transcription factor PITX3 (Fig. 
5B), known to bind with high affinity to the 
TH promotor (38), shows a similar species 
variation. The results also show a surprisingly 
strong and wide distribution of phenyletha- 
nolamine N-methyltransferase (PNMT) (Fig. 
5C), the enzyme converting noradrenaline to 
adrenaline. As expected, PNMT is present in 
the pons and medulla oblongata of all three 
species (39, 40), but unexpectedly high expres- 
sion is found in mouse and human basal 
ganglia, in mouse cortex, and in pig olfactory 
bulb. A transient and wide expression of PNMT 
transcript and protein has been observed in the 
rat brain, but only during the first postnatal 
month (39). A pronounced species difference 
is also seen for the expression of the sodium- 
dependent serotonin transporter (SLC6A4) ex- 
pressed in the brain regions containing sero- 
tonergic neurons in mouse and human, while 
the pig has more widespread expression (Fig. 
5C). It has earlier been reported (47) that the 
expression of this transporter in the human 
and mouse developing brain is widespread 
before its expression is restricted to sero- 
tonergic neurons in the adult brain. Thus, our 
data suggest that SLC6A4 retains its develop- 
mental distribution in adult pigs. 

The analysis of the genes coding for neuro- 
peptide systems also revealed similarities and 
differences in expression between the species; 
for example, in the opioid system there is a 
highly conserved expression of the ligand pro- 
enkephalin (PENK) combined with a dissimilar 
regional expression of the delta (OPRD1) and 
mu (OPRM1) opioid receptors, both G protein- 
coupled receptors (GPCRs) (Fig. 5D). Our data 
support earlier observations that OPRM1 is ex- 
pressed in the human cerebellum (42) but not 
in the mouse cerebellum (43), and here we 
show that this receptor cannot be detected in 
the cerebellum of pig. Instead, the expression 
profiles show the presence of OPRD1 in the pig 
cerebellum. The neuropeptide receptor NPFFR1 
is here shown to be predominantly expressed 
in the human cerebellum, although this re- 
ceptor binds the opioid-modulating peptide 
NPFF expressed in the cerebellum of all three 
species. Similarly, the gastrin/cholecystokinin 
type B receptor (CCKBR) is expressed in the hu- 
man cerebellum, whereas the transcripts for the 
ligands (cholecystokinin and/or gastrin) could 
not be detected in this part of the brain. The 
ligands may have an extracerebellar origin. 

We subsequently analyzed the genes coding 
for the y-aminobutyric acid type A (GABA,) 
and nicotinic receptor subunits (nACHRs) 
(Fig. 5, E and F). Both receptor types can 
form ligand-gated ion channels with differ- 
ent physiological properties by variations in 
the subunit compositions. Many of the studies 
related to these receptors have been performed 
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in mice and rats, and it is therefore also rele- 
vant to compare the expression of the various 
GABA, and nACHRs in humans and pigs. Vari- 
ation in subunit composition between brain 
regions has been reported (44), and it is known 
that the «6 (GABRA6) subunit, which has the 
highest potency for GABA, is only expressed 
in the cerebellum, unlike receptors containing 
the a1 and a2 subunits, which are more wide- 
spread throughout the brain. Carriers of a 
variant allele of the GABRAG gene have an in- 
creased risk for suicide (45), suggesting that 
the cerebellum might be implicated in men- 
tal disorders. 

Comparing the expression of the GABA re- 
ceptors in the three species suggests a con- 
served subunit composition, as exemplified by 
the subunit distribution pattern in cerebral 
cortex and cerebellum (Fig. 5E). In contrast, 
comparing the relative and absolute expres- 
sion of nACHR subunits reveals a high degree 
of phylogenetic differences between mouse, 
human, and pig (Fig. 5F). For example, pigs 
express low levels of neuronal acetylcholine 
receptor subunit a4 (CHRNA4) but higher 
levels of subunits «1 (CHRNAI), «3 (CHRNA3), 
and o6 (CHRNA6) in the cerebral cortex and 
basal ganglia compared with mouse and hu- 
man. Furthermore, both human and pig ex- 
press higher levels of the B1 subunit (CHRNB1) 
compared with mice, and pigs express the 
€ subunit (CHRNE) in the basal ganglia only. 
The £1, a1, and € subunits are all considered to 
be exclusive mammalian muscular nicotinic 
receptor subunits. The results presented here 
therefore suggest alternative nicotinic recep- 
tor composition in different mammalian spe- 
cies and the possibility of “muscular” nicotinic 
receptor subunits involved in the formation of 
functional nicotinic receptors in the central 
nervous system (CNS). These examples con- 
firm the shared basic architecture of the 
mammalian brain with regard to cell types, 
neurotransmitter systems, and physiological 
functions. However, we identified examples 
of clear species variation in the expression lev- 
els and distribution of receptors that suggest 
an important role for receptors in brain evo- 
lution and adaptation. This reinforces the 
need for caution when comparing the function 
of, for examples, serotonergic, opioid, and cho- 
linergic receptors on the basis of animal ex- 
periments using rodents without considering 
the expression pattern of the ortholog receptor 
in humans. This is particularly important in 
the context of drug development, given that at 
least 30% of today’s prescribed drugs act via 
GPCRs (46). 


GPCRs with unknown functions 


We next analyzed the expression profiles of all 
GPCRs to explore differences and similarities 
in expression pattern across the brain regions 
in the three mammalian species (fig. S30). 


7 of 16 


RESEARCH | RESEARCH ARTICLE 


These GPCRs include many olfactory recep- 
tors with highest mRNA levels in the olfac- 
tory bulb, possibly located in the axons of the 
olfactory receptor cells. In addition to the 
GPCRs associated with olfaction and neuro- 
transmission, we identified several human 
orphan GPCRs with brain-elevated expression 
(n = 30 genes). These include the GPR37L1 
and GPR162 expressed in many brain regions 
(Fig. 5G), such as the cerebellum (Fig. 5H), 
but also several brain-elevated orphan GPCRs 
with regionally elevated expression, including 
GPRCS5C, mainly expressed in the cerebellum 
in all three species. We confirm the exclusive 
expression of GPR88 in the basal ganglia (47) 
of human, pig, and mouse, but we were also 
able to identify four brain orphan GPCRs with 
elevated expression in the basal ganglia (Fig. 
5G). The nonvisual photoreceptor melanopsin 
(OPN4) shows high expression in the human 
basal ganglia but very low expression in the 
brains of pig and mouse. OPN4 is expressed by 
ganglion cells in the retina and plays a role 
in the regulation of circadian rhythms (48), 
although the function of this photoreceptor 
deep in the “dark” core of the brain is unclear. 
It was recently reported that melanopsin also 
acts as a thermoreceptor mediating heat- 
activated expression of clock genes (49). Our 
data, based on the expression pattern found 
here, suggest a possible temperature-sensing 
role of OPN4 in the human basal ganglia that 
is not shared with mice or pigs. 


Whole-body versus brain regional tissue 
specificity classification 

The data presented here have made it possible 
to compare the brain enrichment of all genes 
with the whole-body tissue specificity using 
the Tissue Atlas resource (7). The NX data 
across all samples were used to classify all 
protein-coding genes according to organ and 
tissue expression, where the brain was clas- 
sified as a single organ, and 36 additional or- 
gans and tissues were scored across the human 
body. These tissue types include, for example, 
liver, pancreas, intestine, lung, reproductive 
organs, and lymphoid tissues, as well as a group 
of cell types summarized as “blood,” including 
18 single blood cell types and peripheral blood 
mononuclear cells (50). For the brain, the max- 
imum NX value for a given gene in one of the 
brain regions was used as the brain expression 
value. We previously reported 1113 genes with 
elevated expression in the brain on the basis of 
the comparison of the cerebral cortex with 26 pe- 
ripheral tissue types (/, 57). Here, we analyzed 
many more brain regions as well as spinal cord 
and corpus callosum, and we identified many 
more genes (n = 2587) with elevated expression 
in at least one region of the brain compared 
with peripheral tissues (fig. S31). In addition, 
5298 genes were found to be expressed in the 
brain but had elevated expression levels in a 
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peripheral organ. A total of 8342 genes showed 
low tissue specificity across all 37 tissues and 
organs (fig. S31 and table $7). Only 33 genes 
could be specifically defined as enriched in 
the brain and not detected in any of the pe- 
ripheral tissues. Many of these “specific” genes 
were transcription factors, such as neurogen- 
ic differentiaton factors 2 and 6 (NEUROD2 
and NEUROD6), BarH-like 1 homeobox pro- 
tein (BARHL1), and GPCRs such as GPR101 
and GPR26. 

We analyzed the expression levels of the 
2587 human genes classified as brain-elevated 
across all analyzed human peripheral tissues 
(fig. S32A). The analysis demonstrates two 
major clusters: the first with relatively restricted 
expression across the peripheral tissues, and the 
second containing genes with a more tissue- 
wide expression. An analysis of the first cluster 
shows smaller and more specific expression 
clusters, such as a subcluster of genes with 
expression in testis and fallopian tube, in ad- 
dition to the brain. Most of these genes en- 
code proteins specifically expressed in ciliated 
cells, including ependymal cells lining the 
ventricular wall in the brain (fig. S32B). Other 
notable subclusters harbor genes with elevated 
expression in the brain but also high expres- 
sion in peripheral tissues, such as cardiac and 
skeletal muscles (fig. S32C) and liver (fig. S32D). 
The large cluster of genes with elevated ex- 
pression in the brain, but with a more general 
tissue-wide expression pattern, includes a small 
cluster of genes encoding immune tissue- 
associated proteins (fig. S32, E and F). 

The relationship between the whole-body 
specificity and the regional brain specificity 
was then analyzed for all protein-coding genes. 
In Fig. 6, all genes detected in any of the hu- 
man tissues and organs are included with a 
gene-to-gene comparison to the regional brain 
specificity. Only 520 genes that are classified as 
brain-elevated have regional brain specificity 
expression, while a large fraction of the brain- 
elevated genes (7 = 1776) have low regional 
specificity within the brain. The latter include 
(i) several well-known astrocyte markers, in- 
cluding GFAP and aquaporin 4 (AQP4); (ii) 
oligodendrocyte genes involved in myelination, 
including myelin basic protein (MBP) and 
proteolipid protein 1 (PLP1); and (iii) pan- 
neuronal genes expressed by most neurons, 
for example, sodium/potassium-transporting 
adenosine triphosphatase subunit alpha-3 
(ATP1A3). The 520 genes classified as both 
regionally and brain-elevated include genes 
known to be expressed by different neuronal 
populations and genes involved in inter- and 
intracellular signaling cascades, such as (i) re- 
ceptors, e.g., adenosine receptor 2A (ADORA2A) 
enriched in the basal ganglia; (ii) ion channels, 
e.g., calcium voltage-gated channel auxiliary 
subunit gamma 3 (CALCNG3) elevated in re- 
gions of the cerebrum; and (iii) components 
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of intracellular signaling pathways, such as 
GTPases, e.g., Rho guanine nucleotide ex- 
change factor 33 (ARHGEF33) enriched in 
the cerebellum. 

Many of the genes that have regional brain 
specificity expression are not brain-elevated 
from a whole-body perspective but instead 
have elevated expression in one or a group of 
peripheral tissue types. For example, ankyrin-1 
(ANK1), with expression enriched in skeletal 
muscle and tongue, is selectively expressed in 
the cerebellum and, on the protein level, asso- 
ciated with the membrane of Purkinje cells 
(Fig. 6). However, most of the genes classified 
as elevated in tissue types other than brain are 
classified as having low regional specificity 
within the brain, such as a number of proteins 
detected in astrocytes or oligodendrocytes. 
These proteins include crystallin alpha B 
(CRYAB) and aldehyde dehydrogenase 6 fam- 
ily member Al (ALDH6A1), as well as many 
of the microglia proteins, such as the well- 
characterized allograft inflammatory factor 1 
(AIF1) and the neuropil-associated protein 
regulatory factor X2 (RFX2) elevated in testis. 
The group of genes (n = 8027) classified as 
both low tissue specific and low regional 
specific in the brain include many house- 
keeping proteins but also proteins with a 
more selective location to certain cell types, 
such as Acyl-coenzyme A (CoA) synthetase 
long chain family member 4 (ACSL4) mainly 
detected in neuronal cell bodies, and A-kinase 
anchoring protein 17A (AKAP17A) detected in 
the nucleus of glial cells and neurons in the 
cerebellar granular layer (Fig. 6). 


Global and regional expression landscape of 
cortical cell type signature genes 


A large number of differentially expressed 
genes have previously been identified using 
various approaches, including single-cell ge- 
nomics and coexpression analysis. Here, we 
have analyzed the whole-body expression pat- 
tern of a consensus set of signature genes for 
cortical neurons, astrocytes, oligodendrocytes, 
and microglia using an immunopanning ap- 
proach (8) and a coexpression analysis of pub- 
licly available expression data (72). From these 
two datasets, 420 genes were identified as 
putative human cerebral cortex cell type sig- 
nature genes (listed in table S9), with 196 
neuron-specific, 180 astrocyte-specific, 65 
oligodendrocyte-specific, and 51 microglia- 
specific genes. Analyzing the expression vari- 
ance of these cell type signature genes across 
the different regions of the brain showed mul- 
tiple outliers differentially expressed in differ- 
ent parts of the brain for both neuronal and 
astrocyte genes (Fig. 7A). Oligodendrocytes and 
microglia signature genes are less variable 
across the different brain regions. A large frac- 
tion of neuronal and oligodendrocyte genes are 
classified as brain-enriched (Fig. 7B, table S10, 


8 of 16 


RESEARCH | RESEARCH ARTICLE 


Tissue specificity 


Testis 


@ fees: r =I f 
8 i rs) 
5 we 
z= iW 
oO 

pat x 
® a ain 

zs ¢ 
xr eared 


ATP1A3 


Heart muscle || Heart muscl 
CRYAB 


RFX2 


Lymph node 
AIF1 


ACSL4 


Lymph node 


Fig. 6. The regional expression in brain compared with whole-body 
expression. Gene classification based on tissue specificity using transcript 
expression data in 37 different tissue types enables separation of genes 

into brain-elevated, elevated in tissues other than brain, and low tissue 
specificity. This is compared to the classification based on regional expression 
in the brain. Of 1059 genes, 520 with regionally elevated expression were 
also classified as brain-elevated. Genes classified as elevated in tissues other 
than the brain are often detected in brain but expressed with low regional 
specificity. Rho guanine nucleotide exchange factor 33 (ARHGEF33) is brain- 
enriched, including cerebellum-enriched, and here detected in Purkinje cells 
(HPAO41051). ATPase Na*/K* transporting subunit alpha 3 (ATPIA3) is 
group-enriched in brain and heart muscle and detected in all brain regions 
with low regional specificity. Immunohistochemistry using HPAO56446 
displayed the intercalated discs in heart muscle and had a synaptic location 
in brain. Crystallin alpha B (CRYAB) is tissue-enhanced in striated muscle, 
found in all brain regions with low tissue specificity, and detected in 
oligodendrocytes (HPA057100). Regulatory factor X2 (RFX2) is elevated in 
testis and detected in all brain regions with low regional specificity, and with a 
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nuclear localization in testis and neuropil in brain (HPAO48969). AIF1 is 
enhanced in blood and lymphoid tissues, while also detected in microglia in 
all brain regions with low regional specificity (HPAO49234). ACSL4 is classified 
as low specificity both in tissues as well as brain regions (HPAQ05552). 
ADORA2A is group-enriched in lymphoid tissues and brain, including basal 
ganglia—enriched (HPAO75997). Calcium voltage-gated channel auxiliary 
subunit gamma 3 (CACNG3) is brain-enriched and group-enriched in cerebrum 
regions but was not detected in cerebellum, also verified at the protein level 
(HPAO77238). ANK1 is group-enriched in skeletal muscle and tongue as well as 
cerebellum-enhanced in brain, where the protein is selectively associated 
with the Purkinje cell membrane (HPA004842). GFAP is brain-enriched and 
detected in all brain regions with low regional specificity (HPAO56030). 
ALDH6A1 is group-enriched in kidney and liver, detected in all brain regions with 
low regional specificity, and, as GFAP, localized to astrocytes (HPA029074). 
AKAPI7A is expressed in all tissue types with low tissue specificity as well as 
low regional specificity in the brain; the protein is detected in subsets of 

cell nuclei and in brain in glial cells as well as granular cells in cerebellum 
(HPA043247). Scale bar, 25 um. 


and fig. S35), while many astrocyte and microglia 
genes are classified as elevated in other tissues. In 
fact, several astrocyte signature genes are highly 
expressed in liver and/or muscle tissue, whereas 
many microglia signature genes are enriched in 
lymphoid tissue, bone marrow, and/or blood. 
In summary, the global analysis reveals that 
most of the putative astrocyte and microglia 
signature genes are highly expressed in selec- 
tive peripheral tissues, often exceeding the ex- 
pression levels in the brain. 

We superimposed the putative cell type sig- 
nature genes on the global tissue expression 
landscape using the data from this study (Fig. 
7C). Expression of only 180 of the human cere- 
bral cortex cell type signature genes (43%) was 
classified as brain-elevated with regard to ex- 
pression, and 158 genes (38%) were classified 
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as elevated in expression in nonbrain tissues 
and the expression of the remaining genes 
(n = 82) classified as low tissue specific. To 
further explore this lack of “brain specificity” 
of many of the putative brain signature genes, 
we analyzed some of these genes further, as 
shown in Fig. 7D. The microglia signature 
genes arachidonate 5-lipoxygenase (ALOX5) 
and integrin subunit beta 2 ITGB2) both showed 
elevated expression in blood dymphoid tissues), 
while other microglia signature genes showed 
elevated expression in specific blood cells. For 
example, the gene for PYD and CARD domain 
containing protein (PYCARD) is expressed by 
granulocytes, monocytes, and dendritic cells. 
These results confirm the notion of shared 
origin and functions between microglia and 
immune cells. In contrast, several astrocyte sig- 
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nature genes are classified as genes with ele- 
vated expression in liver or muscle tissue. Out 
of the 19 genes with liver-elevated expression, 
six are transport-related genes, including solute 
carrier family 13 member 5 (SLC13A5), detected 
in the membrane of hepatocytes and end feet 
of astrocytes in brain, and nine genes code for 
metabolic enzymes such as aldehyde dehy- 
drogenase 1 family member L1 (ALDHI1L1), 
detected in cytoplasm of hepatocytes and as- 
trocytes. Genes classified as having muscle- 
elevated expression include several proteins 
with structural function, such as syntrophin 
alpha 1 (SNTA1), detected both in astrocytes 
and skeletal muscle. All three are thus classified 
as showing elevated expression in tissues other 
than brain on the tissue level. These results 
highlight the shared function of astrocytes 
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Fig. 7. The gene expression landscape of 420 putative brain cell type signature 
genes in cerebral cortex. (A) The expression variation for the 420 signature 
genes in the different brain regions was analyzed on the basis of the coefficient 

of variation (CV) of NX values across the 10 regions (see figs. S33 and S34 

for expression heatmap and scatterplot across the 10 regions). (B) The expression 
levels for the individual signature genes in the different tissue types. The most 
abundant genes are indicated by gene names (see fig. S32 for expression 
heatmap of the signature genes across all tissues). (©) Chord diagram shows 
the tissue specificity of brain cell type signature genes. Each link represents the 
number of cell type signature genes that are elevated at a tissue level in the 
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analyzed organ and tissue types. (D) Examples of signature genes and their 
protein localization in the different cell types. Microglia signature genes 
(ALOX5, ITGB2, and PYCARD) are highly expressed in spleen. Several astrocyte 
signature genes, including SLC13A5 and ALDHILI, have elevated expression in 
liver, whereas SNTAI is highly expressed in skeletal muscle. Neuronal examples 
(KCNAB2, KCNB1, and PDE1B) are located in neuronal cell bodies in the brain, 
while oligodendrocyte proteins, including MAG, are detected in white matter of 
the brain as well as in smooth muscle (PPP1RI14A) and renal tubule of kidney 
(ENPP6). More detailed expression profiles for all cell signature genes are shown 
in figs. S33 to S35. Scale bars, 25 um. 
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and hepatocytes in the metabolism of secreted 
substrates. 

We subsequently analyzed the signature 
genes for oligodendrocytes. Expression of 
some putative oligodendrocyte signature genes 
is here classified to have low tissue specificity, 
such as protein phosphatase 1 regulatory in- 
hibitor subunit 14A (PPP1R14A). Others are 
classified as having elevated expression in 
peripheral tissues, such as ectonucleotide 
pyrophosphatase 6 (ENPP6), which is elevated 
in kidney. The expression of putative neuronal 
signature genes is here mainly classified as 
brain-elevated, such as potassium voltage- 
gated channel subfamily A regulatory beta 
subunit 2 (KCNAB2), potassium voltage-gated 
channel subfamily B member 1 (KCNBI1), and 
phosphodiesterase 1B (PDE1B), all detected on 
the protein level, using antibody-based local- 
ization, in a subset of neurons and with dif- 
ferent subcellular locations (Fig. '7D). However, 
several neuronal signature genes also showed 
elevated expression in endocrine tissues, the 
pituitary gland, or male tissues. The analysis 
shows that caution should be taken regarding 
genes identified as signature genes, because the 
identification of these is context-dependent, and 
many of the genes previously identified as signa- 
ture genes for specific cell types in the brain are 
in fact also highly expressed in peripheral tissues. 


The HPA Brain Atlas 


As part of this work, a brain atlas database has 
been launched to present and integrate all the 
data reported here, and this is an extension of 
the HPA portal, with a brain-centric summary 
page for each gene with expression data in 
human, pig, and mouse brain regions. The 
resource is presented (77) with an expression 
summary for all protein-coding genes. For 
selected genes, the distribution of the corre- 
sponding protein is visualized by antibody- 
based protein detection. Gene expression in 
the CNS is visually summarized on the basis 
of the 10 brain regions as well as spinal cord, 
corpus callosum, retina, and pituitary gland, 
with underlying data for many more subre- 
gions (21 subregions in human, 27 in pig, and 16 
in mouse). Each of the 10 regions can also be 
reviewed in individual pages, which provide a 
classification overview, interactive lists, and 
figures, as well as highlighted examples of re- 
gionally specialized cells and proteins. A selec- 
tion of 815 proteins is shown at the protein 
level in human tissues, and 271 genes include a 
complete mouse brain profile through a high- 
resolution virtual microscope. These 271 genes 
were analyzed with immunofluorescence-based 
imaging and include protein expression lev- 
els for 120 subregions of the brain. 


Outlook 


The expression profiles for the protein-coding 
genes in all major brain regions have been 
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determined to capture the complexity of the 
cellular organization of the brain and to en- 
able comparison between species. The integra- 
tion of data from several sources has allowed 
us to combine data from transcriptomics, 
single-cell genomics, in situ hybridization, 
and antibody-based protein profiling. The 
rapid technological improvements in the 
field of spatial transcriptomics and single- 
cell genomics will in the future allow for an 
even higher degree of molecular granular- 
ity. The analysis presented here, relying on 
anatomical dissection of the different regions 
of the brain, allowed us to classify all of the 
individual protein-coding genes on a genome- 
wide level, where each gene is scored for its 
regional distribution. The resource provides 
detailed molecular transcriptomics maps of 
the mouse, pig, and human brains, and these 
maps are combined with immunofluorescence- 
based imaging of single cells using antibodies 
toward proteins identified as being of neuro- 
logical and neuropsychiatric interest. In this 
manner, genes could be identified that are 
shown to be differentially expressed between 
organs and within the brain. By including 
more brain regions, the number of transcripts 
detected in the human brain has increased 
compared with previous studies. The number 
of regionally elevated genes in all three species 
is relatively small, with ~1000 genes identified 
in each species with an elevated expression 
(regional enriched, group enriched, or regional 
enhanced) across the 10 brain regions. 
Analysis of the regionally elevated genes in 
the three species presented here supports the 
concept of a similar basic molecular brain ar- 
chitecture during mammalian evolution. The 
genes involved in production, vesicular transport, 
uptake, and degradation of the main neuro- 
transmitter systems show overall high similar- 
ity among the three species, although notable 
differences have been identified. Thus, for ex- 
ample, some of the catecholamine-synthesizing 
enzymes show distinct species differences with 
regard to localization and expression levels, 
and several metabotropic and ionotropic recep- 
tors also exhibit species differences. Many 
neurotransmitter receptors, in particular the 
nicotinic and opioid receptors, show high var- 
iability in the different species, in particular 
between human and mouse. These types of 
gene differences between species highlight the 
fact that mouse models may not provide data 
that can be used to understand and treat hu- 
man mental disorders. For some of the brain 
regions, such as cerebellum and hypothala- 
mus, the global expression profile of pig is 
closer to that of human, suggesting that pig 
might be an attractive animal model to study 
many neurological and mental processes. 
Many “signature genes” identified previously 
for specific brain cell types (such as astrocytes, 
microglia, oligodendrocytes, and neurons) are 
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expressed at higher levels in peripheral organs, 
demonstrating that caution should be taken 
when using such genes as markers of specific 
brain cell types. In fact, our results support a 
view of shared functions between microglia 
and immune cells, with many genes elevated 
in both types of cells. Similarly, many genes 
previously identified as signature genes for 
astrocytes have a functional role in transport, 
and the elevated expression of these genes in 
astrocytes is often shared with liver or skel- 
etal muscle. Cerebellum stands out with regard 
to the number of regional enriched genes and 
genes differentially expressed between species. 
This is also the brain region with the most dis- 
tinct pattern of active cis regulatory elements 
compared with cortical and subcortical struc- 
tures, and the cerebellum also has the highest 
degree of alteration within predicted enhancers 
among primates (52). Several genes suggested 
to be involved in neuropsychiatric diseases are 
found to be selectively expressed in the human 
cerebellum, which might be surprising for a 
brain region traditionally linked to fine-tuning 
motor behaviors. However, these data support 
the emerging notion that this part of the brain 
is associated with many neurological and psy- 
chiatric conditions. 

We describe an integrative approach for 
mapping the molecular profiles in human, 
pig, and mouse brain that generates a detailed 
multilevel view on the protein-coding genes of 
the mammalian brain. We also compare the 
regional differences of the human brain with 
a genome-wide, whole-body tissue-specificity 
classification. An open-access Human Brain 
Atlas knowledge-based resource is presented 
as part of the HPA to allow the exploration of 
individual genes and classes of genes and their 
expression profiles in the various parts of the 
mammalian brain as well as all other major 
parts of the human body. 


Material and methods 
Animal procedures 


The animal experiments conformed to the Eu- 
ropean Communities Council Directive (86/ 
609/EEC), and all efforts were made to mini- 
mize the suffering and the number of animals 
used. Mouse brain tissue samples used for 
transcriptomic and proteomic analyses were 
collected and handled in accordance with 
Swedish laws and regulations, and all experi- 
ments were approved by the local ethical com- 
mittee (Stockholms Norra Djurforsdksetiska 
Namd N183/14). The experimental minipigs 
(Chinese Bama Minipig) were provided by the 
Peral Lab Animal Sci & Tech Co., Ltd (Permit 
number SYXK2017-0123). Brain tissue samples 
used for analysis were collected and handled 
in accordance with national guidance for large 
experimental animals and under permission of 
the local ethical committee (ethical permission 
numbers 444:10500000078 and BGI-IRB18135) 
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and experiments were conducted in line with 
European directives and regulations. 

Wild-type male (n = 2) and female (n = 2) 
C57BL/6J mice (2 months old) were obtained 
from Charles River Laboratories and main- 
tained under standard conditions on a 12-hour 
day/night cycle, with water and food ad libitum. 
Mice were deeply anesthetized and transcar- 
dially perfused with 0.9% saline solution. Brains 
were quickly removed from the skull and dis- 
sected on a glass plate on ice. The entire brain 
was carefully dissected into 16 subregions, and 
corpus callosum, pituitary gland, and retina 
were also collected. A complete list of samples 
and subregions is provided in table S3. Tissue 
samples were collected into tubes, snap frozen 
in dry ice, and stored at —80°C until further 
processing. 

For immunofluorescence and iDISCO anal- 
ysis, mice were anesthetized and transcardially 
perfused using balanced Tyrode’s solution fol- 
lowed by fixation with modified Zamboni fix- 
ative (4% paraformaldehyde, 0.2% picric acid 
in 0.1 M phosphate buffer). For cryosectioning, 
brains were post-fixed for 90 min and trans- 
ferred to PBS containing 30% sucrose and 0.1% 
sodium azide. After cryopreservation, brains 
were snap frozen using CO, and 16-um-thick 
coronal sections were cut on a cryostat (Leica, 
CM1950) and thaw-mounted on SuperFrost 
Plus glass slides (VWR). For iDISCO experi- 
ments, samples were placed in PBS contain- 
ing 0.1% sodium azide until further processing. 

Male (n = 2) and female (m = 2) Chinese 
Bama minipigs (1 year old), were obtained 
from the Pearl Lab Animal Sci & Tech Co., 
Ltd. All animals were housed in a specific 
pathogen-free stable facility under standard 
conditions. Pigs were deeply anesthetized and 
slaughtered by terminal bleeding. The entire 
pig brain was quickly removed from the scull 
and submerged into ice-cold PBS buffer for 
2 min to remove excess blood and stiffen the 
tissue. The brain was cut in coronal slabs at 
the level of (i) frontal lobe/olfactory tract, (ii) 
optic chiasm, and (iii) between hypothalamus 
and cerebral peduncle. Slaps were divided in 
two hemispheres, exposing all main brain 
structures. Sample blocks of one hemisphere 
were immersion-fixed in phosphate-buffered 
saline containing 4% paraformaldehyde for 
1 week followed by storage in phosphate- 
buffered saline containing 0.1% sodium azide 
at 4°C. For mRNA analysis, pieces of cerebral 
cortex and cerebellum were collected on the 
basis of a sampling strategy collecting a rep- 
resentative sample containing all cell layers. 
All other regions were dissected and collected 
in their entirety, subregional samples are listed 
in table S2. Two samples (somatosensory cor- 
tex and periaqueductal gray) are missing from 
female 1, as these two regions could not be 
identified with 100% certainty and thus were 
excluded. Duplicate samples were taken from 
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olfactory bulb from female 2, resulting in 119 
brain samples and an additional 8 samples 
(retina and pituitary gland), for a total of 127 
samples. All samples were stored at —80°C until 
RNA extraction took place, within one month. 

For immunofluorescence analysis, samples 
were immersed in 70% ethanol before dehy- 
dration in absolute alcohol and xylene before 
paraffin embedding. Sections were cut (4 um 
by Microm HM 3558S, Thermo Fisher Scientific) 
and placed on SuperFrost Plus glass slides 
(VWR), baked, and then used for staining or 
stored in —20°C until stained. 


RNA sequencing of pig and mouse 
brain samples 


For mouse brain RNA extraction, the tissue 
was homogenized mechanically using a Tissue- 
Lyser LT (Qiagen) and total RNA was prepared 
using the RNeasy Mini isolation kit (Qiagen) 
for each of the 19 samples. This generated high- 
quality RNA, with 84% of the samples having 
RNA integrity values >8.0, with only one sam- 
ple removed owing to very low RIN value (<6.0). 
RNA integrity (RIN) was assessed using Agilent 
RNA 6000 Nano Kit (Agilent Technologies). 
In total, 75 samples were subsequently used 
for library construction with Illumina TruSeq 
Stranded mRNA reagents. The Illumina 
HiSeq2500 platform was used for sequencing 
at ~20 million reads depth. Detailed informa- 
tion about the samples and sequencing qual- 
ity control is listed in table S11. The output 
analysis was performed using Kallisto v.0.43.1 
and mapped to the mouse Ensembl v92 with 
22,333 protein-coding genes, for the initial 
analysis. Human and mouse orthologs were 
defined as a one-to-one translation, resulting 
in a total of 15,160 genes. 

For pig brain RNA extraction, the tissue was 
homogenized mechanically using a Dounce 
tissue grinder in liquid nitrogen. Total RNA 
was then extracted with a standardized proto- 
col based on TRIzol reagent (Invitrogen). First, 
total mRNA and noncoding RNAs were en- 
riched by removing ribosomal RNA (rRNA) 
using a MGIEasy rRNA depletion kit (MGI 
Tech, China). Enriched RNAs were then mixed 
with RNA fragmentation buffer resulting in 
short fragments (180 to 300 base pairs). Third, 
complementary DNA (cDNA) was synthesized 
from the fragmentated RNAs using N6 random 
primers, followed by end repair and ligation to 
BGISEQ sequencer compatible adapters. The 
quality and quantity of the cDNA libraries were 
assessed using Agilent 2100 BioAnalyzer (Agi- 
lent Technologies). Finally, the libraries were 
sequenced on the BGISEQ-500 with 100 paired- 
end read (PE100). A few randomly selected lib- 
raries were also resequenced and co-validated 
with MGI2000 sequencer. An average of 200 mil- 
lion reads were generated for each library. Se- 
quencing reads that contained adapters and/or 
had low quality, aligned to rRNA were filtered 
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before following bioinformatic analysis. An 
overview of the total reads, Q30 clean reads, and 
mapping ratio to the pig genome (Sscrofal1.1) 
is provided in table S12. More than 94% of the 
samples have <10% rRNA of total reads, indi- 
cating a highly efficient rRNA removal and 
RNA quality. One sample (pituitary gland 
from female 2) was excluded from final data 
analyzed because of high rRNA inclusion (table 
$12). The output analysis was performed using 
Kallisto v.0.43.1 and mapped to the pig En- 
sembl build 92 with 22,342 protein-coding 
genes, for the initial analysis. Human and pig 
orthologs were defined as a one-to-one trans- 
lation, resulting in a total of 14,656 genes. 


Human sequencing datasets 


The Functional Annotation of Mammalian 
Genomes 5 (FANTOMS) project (19) provides 
transcriptomic profiles and functional anno- 
tation of mammalian cell types using cap 
analysis of gene expression (CAGE) (53), a 
method developed at RIKEN that is based 
on several full-length cDNA technologies. 
Expression data files with CAGE peaks and 
ontology for 77 samples (representing 30 dif- 
ferent tissue types) were obtained from the 
version 4 FANTOM5 repository (https://fantom. 
gsc.riken.jp/5/datafiles/reprocessed/), which we 
mapped to Ensembl for calculation of the 
normalized tags per million for each gene. The 
Genotype-Tissue Expression (GTEx) (78) is an 
extensive project that has collected and ana- 
lyzed thousands of human postmortem tissue 
samples. RNA-seq data from 26 tissue types 
Gncluding more than 8000 patient samples) 
were mapped using RSEMVvI1.2.22 (v7,GTEx_ 
Analysis_2016-01-15_v7_RSEMv1.2.22_tran- 
script.tpm.txt.gz) and generated transcript per 
million (TPM) values that are included in the 
Human Protein Atlas. The in-house RNA-seq 
analysis on human tissue types includes 172 tis- 
sue samples covering 33 of the 37 tissue types 
representing the whole human body. The de- 
tailed protocol used for RNA-sequencing in the 
HPA has been described previously (J, 57). 


Normalization of human data 


To enable expression classification and map- 
ping of all human protein-coding genes across 
all tissue types and samples, TPM expression 
values were obtained by mapping processed 
human reads to the human reference genome 
GRCh37/hg19 based on Ensembl build 92 (54) 
gene models using Kallisto (v.0.43.1) (55). Next, 
the gene expression levels were calculated by 
summing up the TPM values of all alterna- 
tively spliced protein coding transcripts for 
the corresponding gene for a total of 19,670 
protein-coding genes. The average TPM value 
of all individual samples for each tissue, brain 
region, or cell type was used to estimate the 
gene expression level. Data analysis and vis- 
ualization were performed using R (version 
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3.5.1 Feather Spray) (56). To allow the three 
datasets (HPA, GTEx, and FANTOM) to be 
combined (J, 18, 19), a pipeline was set up to 
normalize the data for all samples (fig. S4). 
In brief, we first scaled all TPM values per 
sample so that the sum was one million, to 
compensate for the noncoding transcripts 
that had been previously removed and to ob- 
tain pTPM values per sample. Next, all TPM 
values were TMM normalized (22) between all 
the samples in each data source (HPA tissues, 
HPA blood cells, GTEx, and FANTOMS, respec- 
tively), then each gene was Pareto scaled (23) 
within each data source. Tissue data from 
multiple sources were integrated using batch 
correction implemented as removeBatchEffect 
in the R package limma (24) using the data 
source as a batch parameter. The resulting 
transcript expression values, here called nor- 
malized expression (NX), are calculated for 
each gene in every sample. In the Human 
Protein Atlas, the NX value for every gene in 
every sample is calculated and visualized on 
the gene summary page together with the 
pTPM value. The expression classification 
across the 37 tissue types included four tis- 
sues with combined data: brain, intestine, lym- 
phoid tissues, and blood cells, all represented 
by the maximum NX value within each group. 
In general, tissues, cells, or regions including 
multiple data sources or multiple subtissues 
were all represented by a consensus NX value, 
calculated for each gene as the maximum NX 
value in the subtissues/regions or cell types. 


Normalization of pig and mouse data 


All TPM values of pig and mouse datasets were 
TMM normalized (22) between all samples, re- 
spectively, and then each gene was Pareto 
scaled (23) within each species (fig. S4). NX 
for each gene was calculated in every sample 
as described for human, including calculation 
of pTPM values. In the HPA, the pTPM value 
for every gene in every sample is visualized on 
the gene summary page and the more de- 
tailed tissue pages. For regions containing mul- 
tiple subregions, a consensus NX value was 
calculated for each gene as the maximum NX 
value of the subregions (Fig. 1B). 


Comparisons of three species 


Protein-coding genes with one-to-one orthologs 
in human, mouse, and pig were identified to 
compare the expression profiles in the three 
mammalian brains, and altogether 12,999 genes 
were analyzed (fig. SI9A). All NX values of 
the 12,999 genes were then TMM normalized 
(22) between 10 brain regions in three species 
(figs. S4 to S6). 


Classification based on RNA expression 


All protein-coding genes were classified accord- 
ing to a new strategy based on categorization 
on both tissue specificity (expression abun- 
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dance between tissues, table S4) and tissue 
distribution (detection level above cutoff NX =1, 
table S8). Tissue specificity highlights genes 
with elevated expression in one or a group of 
tissue types compared with the rest, with the 
three elevated categories being “enriched” 
(fourfold higher expression in one tissue com- 
pared with the second highest), “group en- 
riched” (fourfold higher expression in a group 
of tissues compared with other tissues), and 
“enhanced” (fourfold higher expression in one 
or several tissues compared with the mean of 
all tissues) (table S4). These classification rules 
were applied to the expression profiles of the 37 
tissue types representing the whole human 
body as well as the different brain regions in 
human, pig, and mouse (Fig. 2). The tissue 
distribution defines the number of tissues 
with expression levels above cutoff (NX = 1) 
(table S8). The combination of tissue speci- 
ficity and distribution from a brain perspec- 
tive (genes detected in brain distributed into 
the different categories) is shown in table $7. 
Tissue-based classification, highlighting the 
brain-elevated genes compared with periph- 
eral tissues, is available for all human protein- 
coding genes, while the regional classification 
in human brain is limited by the availabil- 
ity of external expression data (GTEx and 
FANTOM) (Fig. 1C and fig. $5 for more details 
about the gene coverage and combinations of 
the datasets). A second step of normalization 
was introduced to enable comparison of the 
expression levels across species. All human 
protein-coding genes with one-to-one ortho- 
logs in both mouse and pig (12,999 genes) 
were adjusted by TMM normalization, as il- 
lustrated in the schematic overview of the nor- 
malization pipeline, fig. S4. 


Hierarchical clustering and UMAP analysis 


Clustering in heatmaps and dendrograms based 
on Spearman correlation were created by first 
calculating a correlation matrix of Spearman’s p 
(57) between all brain regions. The correlation 
was converted to a distance metric (1 - p) and 
was clustered using unsupervised top-down 
hierarchical clustering, where, at each stage, 
the distances between clusters are recomputed 
by the Lance-Williams dissimilarity update 
formula according to average linkage. Den- 
drograms showing gene expression in heat- 
maps have been clustered using the Ward2 
algorithm (58), an implementation of Ward’s 
minimum variance method (59) implemented 
as “Ward.D2” in the hclust function in the R 
package stats, where clusters are chosen at 
each stage such that the increase in cluster 
variance is minimized after merging. The hi- 
erarchical clustering of brain regions in three 
species was conducted by using the neighbor- 
joining approach in the ape package (60) in 
R, based on pairwise Pearson correlational 
distances between samples. The reliability of 
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branches was assessed using 100 bootstrap 
replicates. The phylogenetic tree was drawn 
using the plot.phylo function in ape. Uniform 
Manifold Approximation and Projection (UMAP) 
has been performed on NX values of brain sam- 
ples by using the R packages UMAP (67) with 
default parameters. 


Differential expression analysis of three species 


Differential expression analysis was conducted 
by using normalized NX values of 10 regions of 
three species. The R package limma, which in- 
cludes ImFit, eBayes, and topTable functions, 
was used for pairwise comparison of DEGs. 
False discovery rate (FDR) was calculated 
by using p.adjust() function in R, using the 
Benjamini-Hochberg method. Genes with FDRs 
less than 0.01 and absolute fold change larger 
than 2 were considered as differentially ex- 
pressed genes. 


Defining cell type signature genes 


Human cerebral cortex signature genes for 
neurons, astrocytes, oligodendrocytes, and 
microglia were determined on the basis of 
the agreement between two independent (data 
source and approach) datasets. RNA-seq re- 
sults of cells selected using immunopanning 
(8) were obtained from www.brainrnaseq.org, 
and results based on coexpression analysis 
(12) were obtained from http://oldhamlab. 
ctec.ucsf.edu/. By varying the inclusion crite- 
ria for RNA-seq data (fold-enrichment >2 to 
>5) and coexpression analysis (p-value 0.95 to 
1) the optimal settings creating the maximum 
overlap between these datasets for each cell 
type were determined (Table 1). Human ce- 
rebral cortex cell type signature genes were 
defined as genes associated with the same cell 
type based on both datasets with an FPKM 
value of >1 in only one cell type based on RNA- 
seq. The list of 420 genes, here defined as cell 
type signature genes, are listed in table S9. 


Antibody-based profiling of protein distribution 


Protein profiling in human brain tissues was 
performed within the Human Protein Atlas 
pipeline, following previously published pro- 
tocols (7). Formalin fixed paraffin embedded 
(FFPE) tissue samples were used for tissue 
microarray (TMA) construction, where 144 sep- 
arate 1-mm cores were placed in a recipient 
paraffin block (62) representing 44 different 
tissue types. Sections were cut (4 um by Microm 
HM 3558S, Thermo Fisher Scientific) and placed 
on SuperFrost Plus glass slides (VWR). The 
sections were dewaxed, H,O,-incubated, and 
antigen retrieved by heat-induced epitope re- 
trieval (HIER) in pH6 citric acid solution be- 
fore commencing the staining procedure. The 
Leica Biosystems CV5030 immunostainer was 
used for pretreatment as well as in later steps 
of counterstaining and coverslipping. Staining 
protocols were standardized and executed in a 
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Table 1. Criteria used for best overlap in two independent datasets, defining the cell type signature genes. Percentages in parentheses represent 
maximal overlap between two datasets used for selection of inclusion criteria. 


Cell type (fold) 


Neurons >2 


RNA-seq enrichment 


Coexpression analysis 
(P value) 


>0.98 


Number of genes 
(two datasets) 


312 (32.3%) 


Number of signature genes 


679 (30.0%) 196 


Microglia 


LabVision Autostainer 480 using LabVision 
reagents and protocols with primary antibody 
incubation 30 min in room temperature and 
HRP-polymer secondary antibody and DAB 
(3,3-diaminobenzidine) chromogenic visual- 
ization. All slides were counterstained in 
HTXplus (Histolab) before coverslipping, and 
image digitalization was performed in Scan- 
scope AT2 (Aperio Vista) using a 20x objective. 

Protein profiling in mouse brain tissues was 
performed as described previously (63). Com- 
plete brain profiles were represented by 20 to 
25 sections having a 400-um interval, covering 
all major regions of the mouse brain. Briefly, 
sections were incubated with primary anti- 
body (16 to 72 hours at 4°C), blocked in TNB 
buffer (0.1 M Tris-HCl, pH 7.5; 0.15 M NaCl; 
0.5% blocking reagent) followed by HRP- 
conjugated secondary antibody (Dako), and 
immunoreactivity was visualized using the 
tyramide signal amplification system (TSA- 
Plus; NEN Life Science Products, Inc.). Fluo- 
rescent images were obtained using a “VSlide” 
slide scanning microscope (MetaSystems) 
equipped with a CoolCube 2 camera (12-bit 
grayscale), a 10x objective and filter sets for 
4’,6-diamidino-2-phenylindole (DAPI) (EX350/ 
50-EM470/40), Fluorescein isothiocyanate 
(FITC) (EX493/16—-EM527/30), Cyanine (Cy) 
3 (EX546/10-EM580/30), Cy3.5 (EX581/10- 
EM617/40), and Cy5 (EX630/20-647/long 
pass). Finally, the individual images were 
stitched together (VSlide) to generate a large 
image of the entire section, while the images 
(vsi-files) were additionally extracted to high 
quality jpeg files for further analysis using the 
software Metaviewer (Metasystems). All images 
were manually evaluated and scored, always 
including verification by a second observer. 

The human brain antibody-based chromo- 
gen stainings are all available on the Human 
Protein Atlas portal (www.proteinatlas.org). 
Antibody IDs and antibody dilutions are listed 
in table S13. More details about antibody vali- 
dation and antigen design for respective anti- 
bodies are found on the respective antibody 
information page in the portal. Fluorescent 
mouse brain staining shown in Fig. 3 are 
examples from full mouse protein profiles 
available online, performed according to the 
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protocol described above (list of antibodies in 
table S13). The examples shown for species 
comparison in human, pig, and mouse are all 
performed on FFPE tissue sections (4 um), 
placed on SuperFrost Plus glass slides (VWR), 
baked before dewaxing and antigen retrieved 
according to standard procedure for FFPE 
sections as described previously, including 
H,0,-incubated and HIER in pH 6 citric acid 
solution. The sections were then incubated 
overnight at 4°C with primary antibody (list of 
antibodies and dilution in table S13), followed 
by blocking and secondary HRP-conjugated 
secondary antibody. The staining protocol fol- 
lowed was according to the mouse profiling 
standard using TSA amplification. 


iDISCO+ volume immunostaining 


iDISCO+ volume immunostaining and clearing 
process were performed as earlier described 
by Renier and colleagues (64). Briefly, whole 
mouse brains (one hemisphere was laterally 
slightly trimmed) were washed in 0.01 M PBS 
three times in 5-ml Eppendorf tubes and then 
dehydrated in a series of methanol/water so- 
lutions for 1 hour each. The samples were 
then bleached with 5% hydrogen peroxide in 
100% methanol overnight at +4°C. Then, they 
were rehydrated, incubated in permeabiliza- 
tion solution for 2 days, and followed by block- 
ing solution for an additional 2 days, both at 
37°C (0.2% Triton-X100/20% DMSO/0.3 M 
glycine in 0.01 M PBS + 0.02% sodium azide, 
0.2% Triton-X100/10% DMSO/6% normal 
donkey serum in 0.01 M PBS + 0.02% sodium 
azide, respectively). The samples were then 
incubated with a rabbit polyclonal primary 
antibody raised against human GPRI151 
(HPA065728, 1:150) solution for 7 days at 37°C 
(antibody diluent: 0.2% Tween-20/10 pg/ml 
heparin/5% DMSO/3% normal donkey serum 
in 0.01 M PBS + 0.02% sodium azide). After 
extensive washing, the blocks were incubated 
in secondary antibody (1:150; goat anti-rabbit, 
conjugated with Alexa Fluor 647; Molecular 
Probes, Oregon, USA) solution (0.2% Tween- 
20/10 ug/ml heparin/3% normal donkey serum 
in 0.01 M PBS + 0.02% sodium azide). The 
blocks were dehydrated in methanol/water 
series, incubated in 66% dichloromethane/ 
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135 (19.4%) 


33% methanol for 3 hours and in 100% di- 
chloromethane for 2 x 15 min. Finally, the 
blocks were moved to tubes and stored in 
100% dibenzyl ether for the long term. 

A light sheet microscope (Ultramicroscope 
II, Lavision Biotec, Bielefeld, Germany) and 
the Imspector software were used for image 
acquisition of the whole mouse brains. The mi- 
croscope was equipped with an sCMOS cam- 
era (Andor Neo) and a 2x/0.5 objective lens 
(MVPLAPO 23), with a 6.5-mm working dis- 
tance spherical aberration corrected dip- 
ping cap. 

The trimmed full mouse brain (cut at lateral 
ca —2.5 mm) was fixed on the sample holder 
with the surface of the trimmed hemisphere 
side down and acquired in sagittal position 
from ca. lateral —2.00 until the top of the other 
side cortex (ca 6.5 mm altogether). To obtain 
the required X/Y/Z resolution, homogeneous 
illumination within the entire focal plane and 
minimal photobleaching, the following micro- 
scope parameters were applied: 2x objective, 
1.6x zoom body, and additional magnification 
of the dipping cap lens (altogether 3.6 effec- 
tive magnification), 70% laser power (OBIS 
647 laser), bilateral illumination (blend merging 
algorithm), 100-ms exposure time, max sheet 
numerical aperture (0.149), dynamic horizon- 
tal focus process with 13 steps (blend merging 
algorithm after precalibration process), 70% 
sheet width, 2-um Z-step thickness, mosaic 
acquisition mode (six tiles with 15% overlap). 
Stitching was achieved by the Terastitcher- 
Imspector python script (LaVision Biotec, 2017), 
where the serials of 16-bit uncompressed 
stitched tiff images (ca. 3500 z-levels, ca 100 GB) 
were then converted to IMS file, and the 3D 
vision of acquisitions was reconstructed in the 
Imaris 9.1.2 (Bitplane, UK) software for inspec- 
tion and quality control. 
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CELL SURFACE MAPPING 


Microenvironment mapping via Dexter energy 


transfer on immune cells 


Jacob B. Geri, James V. Oakley’, Tamara Reyes-Robles2*, Tao Wang", Stefan J. McCarver’, 
Cory H. White?, Frances P. Rodriguez-Rivera®, Dann L. Parker Jr.°, Erik C. Hett?, 
Olugbeminiyi 0. Fadeyi*+, Rob C. Oslund*+, David W. C. MacMillan} 


Many disease pathologies can be understood through the elucidation of localized biomolecular networks, 
or microenvironments. To this end, enzymatic proximity labeling platforms are broadly applied for 
mapping the wider spatial relationships in subcellular architectures. However, technologies that 

can map microenvironments with higher precision have long been sought. Here, we describe a 
microenvironment-mapping platform that exploits photocatalytic carbene generation to selectively 
identify protein-protein interactions on cell membranes, an approach we term MicroMap (Map). By 
using a photocatalyst-antibody conjugate to spatially localize carbene generation, we demonstrate 
selective labeling of antibody binding targets and their microenvironment protein neighbors. This 
technique identified the constituent proteins of the programmed-death ligand 1 (PD-L1) 
microenvironment in live lymphocytes and selectively labeled within an immunosynaptic junction. 


patial relationships between biomolecules 

underpin the fundamental processes of 

life. In the context of cell surfaces, it has 

long been established that proteins are 

localized into defined assemblies, termed 
microenvironments (7). These substructures 
play a critical role in intercellular communi- 
cation (2). As such, the capacity to precisely 
map these cellular landscapes should provide 
crucial insights into fundamental biology that 
will have wide-reaching implications for human 
health and the development of therapeutic 
strategies (3). In more specific terms, improved 
methods for microenvironment mapping are 
likely to enable discoveries in areas as wide 
ranging as proteomics (antibody target identi- 
fication and discovering protein-protein inter- 
actions) (4), genomics (profiling biomolecules 
near genetic loci) (5), and neuroscience (eluci- 
dating synapse dynamics) (6). 

In recent years, several platforms have 
emerged that enable the specific labeling of 
proteins by using the concept of spatial proxim- 
ity (7-9). These technologies (APEX, SPPLAT, 
EMARS, and BioID) (JO, 17) use tethered enzymes 
that catalytically generate reactive open-shell or 
electrophilic species that target specific amino 
acid residues in neighboring systems. Because 
these intermediates have extended half-lives 
(0.1 ms to 60 s), they undergo diffusion at rates 
comparable with that of labeling (Fig. 1) (12). 
These elegant strategies have primarily been 
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applied to the proteomic profiling and im- 
aging of larger protein architectures such as 
mitochondria, lipid droplets, and nuclear pores 
(13-15) as well as regions such as synaptic clefts 
(16), lipid rafts (9), nuclear lamina (77, 18), and 
protein clusters (79, 20). 

Given the intrinsic value of understanding bio- 
logical systems at the microenvironment level, 
there remains a demand for cellular mapping 
technologies that operate at short range and 
with high precision. With this in mind, we iden- 
tified a series of requirements for a suitable mi- 
cromapping technology that might be used 
across a range of cellular constructs: (i) a catalytic 
manifold tolerant of aqueous conditions and bio- 
molecules; (ii) a catalyst that can be conjugated 
to arange of targeting modalities—antibodies, 
small-molecule ligands, macromolecules, DNA, 
and sugars (27); (iii) a catalysis mechanism that 
can selectively activate chemical probes at a 
diffusion-limited rate; (iv) a readily accessible 
labeling probe that is only activated within 1 nm 
of the catalyst radius; (v) a labeling probe that is 
sufficiently reactive not to undergo long-range 
diffusion after activation; and (vi) a biomolecule- 
labeling mechanism that is both diffusion- 
limited and residue agnostic. The successful 
realization of these goals could, therefore, be 
applied within spatially restricted environments 
that have to date proven challenging to profile 
with high precision, such as the immunosynapse. 


Design plan 


It has long been established that carbenes 
readily cross-link with C-H bonds found in all 
biomolecules [rate constant () = 3.1 x 10° s“'] 
and cannot diffuse farther than 4 nm owing to 
fast quenching by water [k = 3.1 x 10° s“, half- 
life (T1/2) < 2 ns] (22, 23). This reactivity profile 
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renders carbenes ideal reactive intermediates 
for high-precision microenvironment mapping. 
Although diazirine-based probes have been 
widely applied in small-molecule target identi- 
fication (24), the general requirement for direct 
excitation with ultraviolet (UV) light precludes 
the possibility of a target-localized activation 
of free diazirine substrates by using short- 
wavelength light. By contrast, it is well known 
that diazirines exhibit little to no direct absorp- 
tion of blue light (410 to 490 nm) (25). We re- 
cently questioned whether visible light-powered 
iridium catalysts could sensitize diazirines by 
means of a Dexter energy transfer mechanism. 
This would allow blue light-emitting diodes 
(LEDs) to indirectly activate diazirines, local- 
izing the generation of carbenes to within 0.1 nm 
of the photocatalyst. If that is possible, we 
further recognized the opportunity to apply 
this localized excitation to a high-precision 
platform for microenvironment mapping. More 
specifically, we envisioned a photocatalytic prox- 
imity labeling technology for carbene generation 
in which a spatially targeted iridium complex acts 
as an antenna (Fig. 1, top), absorbing the 
photonic energy of visible light and transferring 
it to a diazirine probe (Fig. 2A). Among numerous 
applications, we hypothesized that this approach 
to MicroMapping (uMap) would offer suffi- 
cient spatiotemporal resolution to profile nano- 
scale protein assemblies on the surface of cells. 

The proposed mechanism of the transfor- 
mation is outlined in Fig. 2A. First, a ground- 
state iridium-based photocatalyst would be 
excited to its S, state through absorption of 
blue (450 nm) light. Quantitative intersystem 
crossing to the long-lived triplet excited state 
(T;) (2 = 0.2 to 3 us) is then followed by 
short-range Dexter energy transfer, in which 
the catalyst is returned to its ground Sg state, 
and diazirine is promoted to its T, state (26). 
The triplet diazirine undergoes elimination of 
Nz to release a triplet carbene, which under- 
goes picosecond-time scale spin equilibration 
(27) to its reactive singlet state (71/2. < Ins) and 
then either cross-links with a nearby protein or 
is quenched by the aqueous environment. We 
identified four key catalyst design elements 
required for successful photocatalytic prox- 
imity labeling: (i) a triplet energy in excess of 
60 kcal/mol, facilitating energy transfer to the 
diazirine substrate; (ii) a visible-light extinction 
coefficient (€429) greater than 1000 M‘ cmt, 
to enable the use of light sources that do not 
promote background diazirine photolysis; (iii) 
hydrophilicity for biocompatible conditions; and 
(iv) a suitable reactive handle for bioconjugation. 


Methodology development 


We first tested the feasibility of visible light- 
sensitized N, elimination from diazirines by 
screening a variety of photocatalysts with 
increasing triplet energies (Fig. 2B). Although 
catalysts with triplet energies below 60 kcal/mol 
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did not sensitize model diazirine 1, a catalyst 
with a triplet energy (Ey) exceeding this thresh- 
old {Ir[dF(CF3)ppy]2(dtbbpy)}PF¢ (2) (Er = 
60.1 kcal/mol) (28) promoted consumption of 
1 under mild conditions (15 min, 25°C, 100 uM 
H,O/dimethyl sulfoxide, 450 nm irradiation) 
in >97% yield. No reaction was observed with 
diazirine 1 in the absence of photocatalyst or 
light. We then redesigned this catalyst for bio- 
molecular applications by increasing its water 
solubility through the addition of polyethylene 
glycol, carboxylic acid, and alkyne functional 
groups (3) (Fig. 2B). These modifications did 


Design Objective 


Spatially precise 
microenvironment 
labeling 


w 


Resolution A—-O —_ 
determined by J 
reactive intermediate T/2 = 0.1 ms 


half life (74/2) 


protein on cell surface 


not negatively affect its ability to sensitize Nz 
elimination from 1 (supplementary materials). 
Diazirine sensitization could be extended to a 
variety of p- and m-substituted aryltrifluoro- 
methyl] diazirines bearing valuable payloads for 
microscopy and proteomics applications, in- 
cluding free carboxylic acid, phenol, amine, 
alkyne, carbohydrate, and biotin groups (fig. S1). 
The extinction coefficient of the photocatalyst 
(2) is five orders of magnitude larger than that 
of the diazirine (1) at the wavelength emitted 
by the blue LEDs used for sensitization (450 nm), 
explaining the absence of a noncatalyzed back- 


Proximity Labeling 


ground reaction (fig. S2). Last, we assigned a 
short-range (Dexter) energy transfer mecha- 
nism rather than a longer-range Forster en- 
ergy transfer mechanism on the basis of a 
lack of overlap between the absorption band 
of diazirine 1 and the emission band of irid- 
ium catalyst 2 even at high concentrations 
of 1 (0.1 M) (fig. S3). Energy transfer was high- 
ly efficient, with a rate constant of 7.9(5) x 
10’ M's"! (measured through Stern-Volmer 
analysis; number in parentheses indicates 
standard deviation in trailing digit) (table S1 
and figs. S4 and S5). 
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Fig. 1. High-resolution proximity-based labeling by using carbene intermediates. Spatially precise labeling enables the construction of information-rich 
interaction networks. (Top) The resolution of proximity-based labeling is fundamentally limited by the long solution half-life (7;/2) of the reactive intermediates, such 
as phenoxyl radicals, used to label protein constituents of cell-membrane microenvironments. POI, Protein of interest. (Bottom) Using shorter-lived carbenes as 
reactive intermediates bypasses this limitation, enabling new applications in chemical biology. 
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We sought to demonstrate that carbenes | solution of bovine serum albumin (BSA) (10uM) | of BSA was detected through Western blot. 
generated through photocatalytic diazirine sen- | and a biotinylated diazirine probe 4 (100 uM) | When irradiated with lower-energy visible light 
sitization could label proteins (Fig. 2C). When a | were irradiated with 375-nm light, biotinylation | at 450 nm, the degree of biotinylation was <0.5% 
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A Selective Antibody-Targeted Labeling of Bead-Adsorbed Proteins B CD45 Targeted uMapping on Live Jurkat T-Cells 
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Fig. 3. »Mapping on live cells. (A) Western blot analysis of antibody-targeted on Jurkat cells by using peroxidase does not resolve CD45 and known interactors 
uMapping of VEGFR2-Fc or EGFR-Fc on agarose beads shows spatially selective (red dots) from other proteins, including CD29 and CD47. (E) uMapping the 


protein labeling of VEGFR2-Fc or EGFR-Fc. Error bars in barplot are the standard PD-L1 microenvironment on JY PD-L1 B cells to reveal putative interactors. 
deviation calculated from three independent replicates (n = 3). (B) Western blot — Significantly (FDR-corrected P < 0.05) enriched membrane proteins are high- 
analysis of antibody-targeted uMapping of CD45 on Jurkat T cells. (€) Selective lighted in orange, and nonmembrane proteins are in gray. Labels show PD-L1 


proximity labeling of the CD45, CD47, and CD29 microenvironments using uMap. (orange), CD300A (green), and CD30 (purple) proteins. Venn diagram analysis of 
Significantly [false discovery rate (FDR)-corrected P < 0.05] enriched membrane — enriched membrane proteins shows overlap of PD-L1, CD300A, CD30, and nine 
proteins in volcano plots are highlighted in red (CD45), gold (CD47), or blue other proteins. All volcano plots show averaged log2 ratios (targeted protein 
(CD29), and nonmembrane proteins are in gray. Venn diagram analysis of highly versus isotype) on the x axis (n = 3 replicates) and negative logl0 transformed 
enriched membrane proteins shows minimal overlap. (D) CD45 proximity labeling P values on the y axis. (F) String analysis of convergently enriched proteins. 
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A Trans or Cis Labeling with Map via Intra/Extrasynaptic Targeting 
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Fig. 4. Intra- and extrasynaptic pmapping within a two-cell system. 

(A) Schematic depicting Jurkat (CD45RO*/PD-1*/CD3*) and JY (PD-L1*/CD19*) 
two-cell system enhanced by the presence of staphylococcal enterotoxin D (SED) 
for antibody targeting of intrasynaptic PD-L1 (expressed on JY cells) and 
extrasynaptic CD45RO (distinctly expressed on Jurkat cells) for selective trans or 
cis labeling. (B) Flow cytometry analysis of antibody targeting of PD-L1 on JY 
PD-L1 B cells with wMap by using 10-min light irradiation (left) shows both cis- 
and trans-cellular labeling. Peroxidase-based targeted labeling for 0.5 min (right) 
results in nearly complete cis- and trans-cellular labeling. (©) Flow cytometry 
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analysis of antibody targeting of CD45RO on Jurkat T cells with uMap by using 
10-min light irradiation (left) shows only cis-cellular labeling. Peroxidase-based 
targeted labeling for 0.5 min (right) results in nearly complete cis- and trans- 
cellular labeling. (D) Two-cell system confocal microscopy images of isotype- 
targeted (10-min light irradiation) or PD-Ll-targeted (0.5-, 2-, or 10-min light 
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(0.5 or 1 min) by using peroxidase-based labeling (right). Cells were imaged for 


biotinylation (green), CD3 surface expression (magenta), and nuclei (blue). Scale 
bar, 5 um. Duplicate images are shown below each condition. 
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of the level observed through UV irradiation, es- 
tablishing that the diazirine presents minimal 
background signal at this wavelength. However, 
in the presence of water-soluble iridium catalyst 
3 (1M), catalyst-dependent biotinylation of BSA 
was observed. Photocatalytic labeling of BSA 
was further confirmed through intact protein 
mass spectrometry (fig. S6). Unlike other enzyme- 
based labeling methodologies, this approach 
requires continuous delivery of visible light to 
sustain diazirine sensitization for protein label- 
ing. Accordingly, we exploited this feature to 
demonstrate how turning the light source on 
or off affords fine temporal control over the 
labeling process (Fig. 2C, right). 

With an efficient photocatalytic system for 
carbene-based protein labeling in hand, we 
prepared a secondary antibody-photocatalyst 
conjugate as a general entry point for spatially 
targeted photocatalytic proximity labeling on 
cell surfaces. A goat anti-mouse (Gt/a-Ms) anti- 
body was first decorated with azide groups 
through reaction with azidobutyric acid N- 
hydroxysuccinimide ester and then conjugated 
to alkyne-bearing iridium catalyst 3 by means 
of click chemistry, resulting in an antibody- 
photocatalyst ratio of 1:6. Next, to address protein- 
targeted labeling on a surface, we prepared a 
model system containing human Fc-tagged 
vascular endothelial growth factor receptor 2 
(VEGFR2) and epidermal growth factor recep- 
tor (EGFR) proteins attached to a-human im- 
munoglobulin G (IgG) agarose beads (Fig. 3A). 
These beads were sequentially incubated with 
a Ms/a-VEGFR2 antibody and Ir-Gt/o-Ms to 
position the iridium catalyst close to the VEGFR2 
proteins on the bead surface. Irradiation of these 
beads with 450-nm light in the presence of a 
diazirine-biotin probe afforded selective labeling 
of VEGFR2 over EGFR. When Ms/o-EGFR was 
used as the primary antibody, the selectivity 
of labeling was reversed. An analogous ex- 
periment, using peroxidase-based labeling, was 
incapable of differentiating between EGFR or 
VEGFR2 (fig. S7). 


Microenvironment mapping on cell membranes 


We next applied antibody-targeted photo- 
catalytic diazirine activation (uMap) to the 
surface of live cells. For these experiments, 
addition of the antibody to the cell surface 
was maintained at 4°C to limit antibody- 
mediated protein cross-linking (fig. S8) (29, 30). 
We selected CD45, a highly abundant tyrosine 
phosphatase on T cell surfaces involved in 
antigen receptor signaling (37), as an initial 
target. Western blot analysis of CD45-targeted 
uMap on Jurkat cells showed light- and time- 
dependent protein biotinylation compared with 
the isotype-targeting control (Fig. 3B). Next, we 
used tandem mass tag (TMT)-based quantita- 
tive proteomic analysis of streptavidin-enriched 
proteins to identify CD45 and two known as- 
sociators (CD45AP and CD2) through STRING 
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analysis as part of a wider subset of enriched 
cell membrane proteins (Fig. 3C) (32). 

With proof of concept for cell surface label- 
ing in hand, we next questioned whether nuMap 
could differentiate between spatially separated 
microenvironments on the same cell mem- 
brane. To this end, we selected CD29 and 
CD47 as ideal targets with, to the best of our 
knowledge, no known co-spatial association 
on the cell surface. uMapping of CD45, CD29, or 
CD47 on Jurkat cells resulted in the enrichment 
of distinct sets of proteins, which included both 
known (CD29:CD49D, CD45:CD45AP:CD2) and 
previously unknown interactors (Fig. 3C). Cru- 
cially, although several proteins were shared 
between pairs of targeted proteins, none were 
shared across all three, validating the ability 
of uMap to discriminate between unrelated 
microenvironments. By contrast, when using 
state-of-the-art peroxidase-based proximity label- 
ing methods (10, 33), cell surface CD45 and 
associated proteins were not selectively resolved 
from CD29 or CD47 (Fig. 3D and fig. S9). 

Next, we harnessed the resolution and selec- 
tivity of uMap to investigate the proximal protein 
interactome of programmed-death ligand 1 (PD- 
L}) in B cells. As has been well established, PD-L1 
plays an important role in cancer cells as an im- 
mune checkpoint ligand that can accelerate 
tumor progression through suppression of T cell 
activity (34). In the event, PD-L1-targeted uMap 
revealed CD30, a member of the tumor necrosis 
factor receptor family (35), and CD300A, an im- 
mune inhibitory receptor (Fig. 3E) (36), as po- 
tentially new interactors based on significant 
enrichment. These results highlight the potential 
of pMapping to provide new insights with respect 
to the microenvironments of checkpoint proteins. 

To further validate the enriched subset of 
proteins identified with PD-L1 uMapping, we 
performed targeted labeling of these two highly 
enriched proteins. Targeted uMapping of these 
proteins within the PD-L1 microenvironment 
should, therefore, afford similar enrichment 
lists, verifying their spatial association. We 
found that o-CD30-, o-CD300A-, and o-PD- 
Li-directed uMapping identified the same set 
of 12 surface receptors (Fig. 3F). 

It is well recognized that the development of 
new therapeutic oncology strategies will require 
an understanding of the underlying mechanisms 
of intercellular communication, particularly 
within the context of T cell activation and differ- 
entiation. Furthermore, given that localization 
of PD-L1 is found within the T cell/antigen- 
presenting cell (APC) immunosynapse (at the 
interface between two immunointeractive cells), 
we hypothesized that PD-Li-directed uMapping 
should lead not only to biotinylation of a PD-LI- 
expressing APC surface (cis-labeling) but also 
to the biotinylation of the adjacent synaptic 
T cell (trans-labeling) (Fig. 4A). As a control ex- 
periment, we further posited that when target- 
ing a protein excluded from the synapse, such 
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as CD45RO (37, 38), the diffusion-minimized 
radius of uMap would preclude biotinylation 
of the distant trans-cell membrane. 

We evaluated PD-LI- and CD45-targeted 
uMapping in a two-cell system composed of 
PD-Ll-expressing JY B-lymphocytes as the 
APC and Jurkat T-lymphocytes distinctly ex- 
pressing PD-1 and the CD45RO isoform (fig. 
S10). Using staphylococcal enterotoxin D (SED) 
to facilitate T cell receptor (TCR)-major histo- 
compatibility complex (MHC) engagement and 
promote B cell/T cell immune synapse forma- 
tion and signaling, we assessed the ability of 
uMap to selectively label within intercellular 
synapses (Fig. 4A and fig. S11) (39-41). Flow 
cytometry analysis was then used to monitor 
the extent of cis- and trans-cellular labeling. 
As anticipated, PD-L1-targeted uMap resulted 
in both cis- and trans-cellular labeling, where- 
as CD45RO-targeted uMap led to selective cis- 
labeling on the CD45RO-expressing Jurkat cells 
(Fig. 4, B and C, and fig. S12), without any 
labeling of the adjacent B cell. In stark contrast 
to uMap, peroxidase-based proximity labeling 
of PD-L1 or CD45RO within this two-cell sys- 
tem led to complete labeling of both cell types 
within 30 s, clearly visualized through flow 
cytometry and confocal microscopy (Fig. 4, 
B, C, and D; and fig. S12). In comparison, PD- 
Li-targeted uMap showed high selectivity for 
trans-labeling solely at the cis- and trans-cellular 
contact regions (Fig. 4D and fig. $12). Applying 
the uMap technology on PD-1 in the Jurkat-JY 
coculture system resulted in the reciprocal trend 
of cis- and trans-cellular labeling (fig. S13). Col- 
lectively, these findings clearly demonstrate that 
the capacity of uMap to elucidate protein-protein 
interactions can be directly translated toward 
the highly selective labeling of dynamic inter- 
faces within complex multicellular systems. 

We expect that this technology will find 
immediate use in antibody target identifica- 
tion, exploration of signal transduction path- 
ways, profiling cell-cell junctions, and other 
disease-relevant microenvironments. 
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SOLAR CELLS 


Triple-halide wide-band gap perovskites with 
suppressed phase segregation for efficient tandems 
Jixian Xu??>*+, Caleb C. Boyd”*+, Zhengshan J. Yu°, Axel F. Palmstrom?, Daniel J. Witter’, 


Bryon W. Larson’, Ryan M. France’, Jérémie Werner*2, Steven P. Harvey’, Eli J. Wolf, 
William Weigand®, Salman Manzoor®, Maikel F. A. M. van Hest’, Joseph J. Berry”, Joseph M. Luther’, 


Zachary C. Holman®, Michael D. McGehee'2:>* 


Wide-band gap metal halide perovskites are promising semiconductors to pair with silicon in tandem 
solar cells to pursue the goal of achieving power conversion efficiency (PCE) greater than 30% at low 
cost. However, wide-band gap perovskite solar cells have been fundamentally limited by photoinduced 
phase segregation and low open-circuit voltage. We report efficient 1.67—electron volt wide-band 

gap perovskite top cells using triple-halide alloys (chlorine, bromine, iodine) to tailor the band gap 
and stabilize the semiconductor under illumination. We show a factor of 2 increase in photocarrier 
lifetime and charge-carrier mobility that resulted from enhancing the solubility of chlorine by replacing 
some of the iodine with bromine to shrink the lattice parameter. We observed a suppression of 
light-induced phase segregation in films even at 100-sun illumination intensity and less than 4% 
degradation in semitransparent top cells after 1000 hours of maximum power point (MPP) operation 
at 60°C. By integrating these top cells with silicon bottom cells, we achieved a PCE of 27% in 
two-terminal monolithic tandems with an area of 1 square centimeter. 


ompositional tuning can yield opto- 

electronic properties of state-of-the-art 

ABXz metal halide perovskites that are 

desirable for use in highly efficient solar 

cells, light-emitting diodes (LEDs), and 
detectors. Such tuning has been achieved 
through mixing of cations such as methylam- 
monium (MA), formamidinium (FA), cesium 
(Cs), dimethylammonium (DMA), and meth- 
ylenediammonium (MDA) at the A site (7-6); 
lead (Pb) and tin (Sn) at the B site (7, 8); and 
iodine (I) and bromine (Br) at the X site (9, 10). 
Wide-band gap alloys (>1.65 eV) in particular 
are of considerable interest for both LEDs and 
multijunction solar cells (77-13). The latter can 
overcome the Shockley-Queisser limit of single- 
junction cells by stacking complementary 
wide-band gap and narrow-band gap absorb- 
ers to reduce thermalization losses (71, 14). As 
market-dominant silicon solar cells approach 
their theoretical limit of 29.1% with a lab record 
of 26.7% (15, 16), the development of multi- 
junction solar cells or tandem solar cells could 
help to further increase the power conversion 
efficiency (PCE) and reduce the levelized cost 
of photovoltaic (PV) electricity (17, 18). 
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Although a range of band gaps from 1.6 to 
3.06 eV can be achieved through I/Br and Br/Cl 
alloying (9, 10, 19-21), wide-band gap alloys in 
the optimum range for use in tandem solar 
cells suffer from short diffusion lengths and 
photoinduced phase segregation, which lead to 
poor optoelectronic qualities and substantial 
open-circuit voltage (V,,) deficits relative to 
their theoretical limit (72, 22, 23). Studies of 
this unexpectedly large V,, deficit revealed 
photoinduced trap formation by halide segre- 
gation, especially when the Br fraction on the 
X-site was >20% (12, 22, 23). The alloyed I/Br 
phases segregated upon illumination and photo- 
carriers were funneled into the low-band gap 
I-rich domains, which acted as traps that red- 
shift the photoluminescence and reduce the V,. 
(23). Reduced photoinduced phase segregation 
and improved material quality of wide-band 
gap perovskites through cation substitution of 
Cs or DMA achieve wide band gaps while limit- 
ing the Br fraction (2, 24, 25). Additive engi- 
neering can reduce defect densities (26-30). 
Surface engineering techniques such as the 
formation of 2D/3D heterostructures have also 
been used for improving device V,, (37). How- 
ever, wide-band gap solar cells still are in need 
of both a reduced V,, deficit and improved 
photostability (73). 

Inspired by recent advances in achieving im- 
proved material quality with triple- or quadruple- 
cation perovskites (/, 6) and reports suggesting 
that Cl as an additive may reduce defect density 
(32-35), we explored the triple-halide (I, Br, Cl) 
compositional space. Generally, previous stud- 
ies have reported the use of Cl as an additive 
or precursor to affect morphology and surface 
passivation but with little or no incorporation 
into the bulk material. The Cl also had little 


impact on the band gap, independent of the Cl 
ratio included in the precursor (33-40). The 
lack of incorporation into the bulk lattice with 
little change in the band gap occurs because Cl 
typically volatilizes as MAC] or FAC] during 
annealing of the perovskite film and only acts 
to control film crystallization, as it resides at 
grain boundaries or on perovskite film surfaces 
in the final material. 

We report that Cs and Br act as a bridge to 
a new phase space in wide-band gap perov- 
skites of triple-halide alloys containing I, Br, 
and Cl at molar amounts >15%. We directly 
incorporated Cl into the lattice at much larger 
amounts than previously reported, observing 
a uniform halide distribution throughout the 
material with a reduction in lattice parameter 
and an increase in band gap corresponding to 
increasing amounts of Cl in the lattice. By 
extending the double-halide to triple-halide 
alloy, we observed a distinct enhancement in 
optoelectronic characteristics, such as factor 
of 2 increases in photocarrier lifetime and 
mobility and the suppression of light-induced 
phase segregation at intensities up to 100 suns. 
These material advances reduced the V,, deficit 
of 1.67-eV wide-band gap solar cells by 100 mV 
and boosted PCE from ~18% to 20.3% (Table 
1 and table S1). We demonstrated large-area 
(1cm?), wide-band gap semitransparent top 
cells for Si tandems with a high certified per- 
formance of 16.83% that resulted from a sub- 
stantially improved V,, relative to prior reports 
(Table 1 and table S2). The triple-halide mate- 
rial was also operationally stable in solar cells 
that maintained >96% of their initial efficiency 
after 1000 hours of maximum power tracking 
under white light illumination at 60°C, and 
>97% of their initial efficiency after 500 hours 
at 85°C. By integrating triple-halide semi- 
transparent top cells with Si bottom cells with 
PCE of ~21% (41), we boosted their PCE by 
30%, to 27% in 1-cm? two-terminal monolithic 
tandems (Table 1 and table S3). 


Alloying chlorine into the perovskite lattice 


We started with a targeted perovskite band 
gap of ~1.67 eV, which is in the range of ideal 
top-cell band gaps for a perovskite/Si tandem, 
taking into account imperfect absorption near 
the band edge (42, 43). FAg.75CSo.25Pb(1o. sBro.2)3 
has a band gap of 1.67 eV and was recently 
demonstrated to enable 25% PCE in two- 
terminal perovskite/Si tandems (25). We denote 
this perovskite family as CsFA perovskite, and 
denote this composition as Cs25Br20 in ac- 
cordance with the percentage of Cs at the A 
site and the percentage of Br at the X site. 
Cs25Br20 outperformed its 1.67-eV counter- 
part Cs17Br25 in light stability and V,, deficit, 
attributed to the reduced Br fraction and in- 
creased Cs fraction. To further improve the 
photostability, we reduced the Br fraction to 
15% and found that Cs25Br15 has a band gap 
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of ~1.63 eV. To recover the optimal 1.67-eV band 
gap, we attempted to increase the Cs fraction 
but obtained lower PV performance (25), con- 
sistent with recent reports that a high Cs frac- 
tion might lead to halide inhomogeneity and 
phase segregation (4, 44). 

We sought to raise the band gap by alloying 
MAPDCl; perovskite into the lattice of CsFA 
perovskites. We chose MAPbCl, perovskite 
because it has a wide band gap (2.88 eV) and 
higher stability than its Br and I counterparts 
(45, 46). When we mixed Cs25Br15 with 2 to 
5 mol % MAPDCIl; (referred to hereafter as 
Cs25Br15+Cl2 and Cs25Br15+Cl5) in solution, 
the band gap of the resulting films could be 
continuously raised from 1.63 to >1.67 eV 
(Fig. 1A and fig. S1). The sharp onset of the 
external quantum efficiency (EQE) indicated 
that a photoactive triple-halide perovskite phase 
had formed. 

We used x-ray diffraction (XRD) to verify the 
triple-halide phase and determine the lattice 
constant. As indicated by the shift of the dif- 
fraction peaks (figs. S2 and S3), the perovskite 
single phase was retained and the lattice con- 
stant continually decreased with increasing 
MAPDbCl, fraction. Specifically, Cs25Br15+Cl5 
has a lattice constant of ~6.24 A (Fig. IF), which 
is similar to that of Cs25Br20 and evidently 
less than that of the host perovskite Cs25Br15, 
6.27 A. 

Incorporating Cl into perovskite precursor 
solutions has been widely reported to increase 
the size of apparent grain domains in solid 
films, because the formation and outgassing of 
MACI modify the nucleation and crystal growth 
dynamics (47-51). Prior reports showed that the 
band gap remained substantially unchanged 
and that the Cl concentrations in the solid 
films were below the detection limit of energy- 
dispersive x-ray spectrometry (EDX) and x-ray 
photoelectron spectroscopy (XPS), even though 
a large portion of Cl was present in the pre- 
cursors (33, 34, 36). In contrast, we did not 
observe enlargement of grain domains when 
more Cl was incorporated into the triple-halide 
films, as shown by scanning electron micro- 
scope (SEM) images (Fig. 1B and fig. S4). 

In addition, we could readily detect Cl using 
XPS. The Cl-to-Pb ratio of films also increased 
when a higher fraction of Cl was included in 


the precursors (fig. S5). The Cl in the triple- 
halide films was uniformly distributed through- 
out the entire film thickness, similar to I and Br, 
as shown by time-of-flight secondary ion mass 
spectrometry (TOF-SIMS) depth profiling (Fig. 
1C). To mitigate measurement artifacts (52), 
we recorded secondary ion clusters or molec- 
ular ion signals of Cs,Cl*, Cs4I;*, and CsBr* to 
track the depth profile of halides. The relative 
scaling of SIMS profiles between halides did 
not represent their stoichiometric ratio (53). 

Taken together, the EQE, XRD, SEM, XPS, 
and SIMS observations show that Cl was 
incorporated into the perovskite lattice and 
increased its band gap, rather than being 
sacrificed as a volatile phase. Because the lattice 
constant of the host perovskites was reduced 
by the Cs and Br, Cl was closer to the ideal 
size for the X-site in the lattice. This expla- 
nation is consistent with the formation of 
MAPbBr/Cl perovskites being more thermo- 
dynamically preferred than MAPbI/Cl double- 
halide alloys (19-2D). 

Because perovskites with even wider band 
gaps are needed for all-perovskite tandems 
and possibly other applications, we identified 
the phase boundary of the triple-halide alloy- 
ing strategy and obtained additional mecha- 
nistic insights into the perovskite phase space. 
When we further increased the molar ratio 
of Cl/(I+Br+Cl) to above 10% in Cs25Br15, we 
observed an inflection point in the band gap 
evolution curve (Fig. 1D, red line). That is, the 
band gap of the films began to decrease with 
increased Cl content, as shown in the EQE 
spectra (fig. S1). This increasing and then de- 
creasing of the band gap with triple-halide 
alloying is anomalous and distinctly different 
from the monotonic growth trend reported in 
I/Br double-halide alloys (9, 10, 23). 

Structural evolution observed with XRD (Fig. 
1, Eand F, and fig. $2) revealed that the band 
gap decrease at high Cl content was caused by 
phase segregation into two perovskite phases. 
At the phase boundary, the diffraction peaks of 
a segregated phase (very likely MAPbC1,Brs3_,,) 
emerged (fig. S2, A and D). The lattice of the 
host perovskite then expanded (diffraction 
peaks shifted to lower angles) with the growth 
of the segregated perovskite phase (Fig. 1F, red 
line, and fig. S2D), consistent with the ob- 


served band gap decrease and suggesting the 
segregation of a high-Br/Cl phase, which left 
behind an I-rich, lower-band gap host perov- 
skite (Fig. 1D and fig. S1). 

To further validate the generality of the 
triple-halide alloy, we also used Cs25Br30 and 
Cs25Br40 as the host perovskites to investigate 
the evolution of band gap (see EQE data in 
figs. S6 and S7) and lattice constant (see XRD 
data in figs. S8 and S9) when increasing the 
triple-halide alloy ratio. The trends in the band 
gap (Fig. 1D) and lattice constant (Fig. 1F) were 
consistent, further substantiating this mecha- 
nism of triple-halide alloying. In all cases, the 
band gap-raising rate was ~0.0078 eV, with each 
percent of Cl alloyed to form the single phase. 
The inflection point of band gap evolution (or 
the phase boundary between the single-phase 
triple-halide alloy and phase segregation) con- 
sistently increased with increased Br content, 
as in Cs25Br15, Cs25Br30, and Cs25Br40, indi- 
cating a wider single-phase range and greater 
tolerance to phase segregation in the perovskite 
with higher Br content. This finding verified 
that Br reduced the alloy barrier between I and 
Cl phases for triple-halide alloys. The addition 
of PbCl, in Cs25Br15 also increased the band 
gap in a manner similar to the effect of adding 
MAPDbCl, (fig. S10), thereby reconfirming that 
Cl incorporation is the determinant to the band 
gap raising in triple-halide alloys. Adding MAC] 
alone did not raise the band gap (fig. S10A), in 
agreement with previous reports that MAC] 
volatilizes during film annealing (47-51). 


Improved optoelectronic properties of 
triple-halide perovskites 


Recent studies showcase the improvements 
in optoelectronic properties that result from 
incorporating even small amounts of Cl (<1%) 
(32-34, 47, 50). For example, small Cl ions 
might reduce the halogen vacancy defect den- 
sity and improve the stability of perovskites in 
ambient air (32). Given that in our triple-halide 
perovskites, Cl-based reagents are well incor- 
porated into the lattice, we used time-resolved 
microwave conductivity (TRMC) to study their 
electronic properties and charge-carrier dyna- 
mics (54). Despite no increase in apparent grain 
size in the triple-halide perovskites (Fig. 1B), 
an enhancement in both the charge-carrier 


er] 
Table 1. PV parameters of solar cells using 1.67-eV triple-halide wide-band gap perovskites. Area is defined by optical aperture placed in front 


of devices. 
Cell configuration Area (cm?) Voc (V) Je¢ (mA cm-) FF (%) PCE (%) Stabilized PCE (%) 
Opaque single-junction 0.06 ly) 20.18 83.16 20.32 


Two-terminal tandem on Si 1 


*11-point stabilized power output (SPO) measurement. 
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mobility and lifetime by nearly a factor of 2, 
equivalent to a doubled carrier diffusion length, 
was seen in comparison to our best 1.67-eV con- 
trols. Specifically, when the absorbed photon 
flux was ~1 x 10'° cm (near I-sun intensity), 
the lifetime increased from 420 to 846 ns (Fig. 
2A) and the charge-carrier mobility improved 
from 29 to 53 cm? V's" (Fig. 2B). 

To disentangle the effect of Cl incorporation 
on the bulk and surface, we applied a passi- 
vating layer on top of the perovskite to reduce 
surface recombination. We chose LiF because 
it is insulating and suppresses nonradiative 
surface recombination in perovskites, thereby 
increasing the carrier effective lifetime (55). In 
TRMC measurements, LiF-capped perovskite 
samples exhibited the same mobility and life- 
time as bare perovskite samples (Fig. 2), which 
confirmed that the enhancement in photo- 
carrier transport in triple-halide perovskites is 
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Fig. 1. Characteristics of triple-halide perovskite alloys. (A) EQE measure- 
ments show the band gap raising via the formation of triple-halide perovskites with 
2 to 5 mol % Cl relative to host perovskites. Cs25Br15 and Cs25Br20 denote the 
double-halide perovskites Cso,25F Ao75Pb(lo.gsBro.1s)3 and Cso,25F Ao 75Pb(lo.gBro.2)3, 
respectively. Cs25Br15+Cl2 and Cs25Br15+Cl5 denote the triple-halide perovskites 
with an additional 2 or 5 mol % MAPbCls3 included in the Cs25Br15, respectively. 

(B) Top-view SEM images show no evident difference of apparent grain size between 
triple-halide perovskites and double-halide control films. (©) TOF-SIMS depth 
profiles show the uniform distribution of halides throughout the entire film thickness of 
the triple-halide perovskite (Cs25Br15+C15). Here, the Cs trace is not included because 
a Cs ion beam was used for sputtering to increase halide signal intensities, which 
makes the Cs profile through the device stack difficult to interpret. The secondary ion 
clusters or molecular ion signals of Cs2Cl*, Csql3", and Cs2Br* were recorded to 
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dominated by the bulk properties rather than 
surfaces. Given that there was no increase of 
grain size, we assert that the harmful defect 
density for carriers is greatly reduced by Cl 
incorporation in triple-halide films. Dark micro- 
wave conductivity measurements showed a 
factor of 2 reduction (from 7.8 x 10° cm™ to 
2.9 x 10° em™®) in dark carrier density (fig. 
S11), confirming the reduced defect density 
in triple-halide perovskites over the control. 
Taking into account both the potential con- 
tribution of defects to photoinstability (26-30) 
and the reduced Br ratio in triple-halide perov- 
skites, we expected that the photoinduced phase 
segregation would also be reduced. 

To compare the susceptibility of triple-halide 
films and our best Cs25Br20 controls to light- 
induced phase segregation, we carried out time- 
and intensity-dependent photoluminescence 
(PL) measurements under 488-nm continuous- 
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wave laser illumination in a Nj environment. 
A red shift or a lower-energy peak forming in 
the PL spectrum indicates the formation of 
light-induced low-band gap I-rich trap states, 
as seen in previous reports (23, 25, 28, 30, 56). 
Here, to avoid noise and error in peak position, 
we used the spectral centroid to quantitatively 
indicate the evolution of the population span- 
ning the entire PL spectral range. During the 
course of 20 min at 10-sun-equivalent illumi- 
nation, Cs25Br20 controls exhibited low-energy 
PL peaks and increasing peak width (Fig. 3A). 
In contrast, 1.67-eV triple-halide films showed 
no low-energy peak formation and retained 
their PL spectral profile, highlighting their 
superior photostability (Fig. 3D). At an ultra- 
high injection level of 100 suns, the red shift 
and broadening of the PL peak became more 
apparent in Cs25Br20 controls (Fig. 3B), con- 
sistent with previous reports. Surprisingly, we 
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ichiometric ratio. ETL (electron transport layer) is a 
stack of Ceo/BCP layers. (D) Evolution curves of band gap for triple-halide films with 
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ase triple-halide alloy to double-phase segregation 


with increased Cl content. Here, the host perovskite is Cs25Br40. The intensity of XRD 
peaks is rescaled for ease of peak position comparison. (F) Evolution curves of 

host perovskite lattice constant with increasing ratios of Cl/(I+Br+Cl). The dashed line 
delineates the single-phase and double-phase zones corresponding to band gap 
raising and band gap reducing, respectively. 
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observed a PL blue shift, rather than a red 
shift, in triple-halide films caused by the emis- 
sion growth at the PL high-energy shoulder 
(Fig. 3E). This opposite trend was also ob- 
served in the traces of the spectral centroid over 
time (Fig. 3, C and F). 

This PL blue shift of wide-band gap perov- 
skites at a high injection level is unusual and 
strongly suggests that incorporation of Cl into 
the lattice had an impact on the optoelectronic 
properties and halide phase-segregation path- 
ways. Our TRMC and PL data suggested a com- 
bination of mechanisms: (i) the Moss-Burstein 
effect in the phase-stabilized semiconductors, 
which originates from the lifting of the quasi- 
Fermi level when the conduction band edge is 
fully populated at a high injection level (57, 58), 
and (ii) a decrease in defect concentration. 


Efficient and stable triple-halide 
perovskite/silicon tandems 


To evaluate the effect of doubled charge- 
carrier diffusion length and better photo- 
stability in triple-halide films, we fabricated 
1.67-eV single-junction solar cells with opaque 
metal contacts. We used a p-i-n structure (Fig. 
4A): glass / indium tin oxide (ITO) / poly(4- 
butylphenyl-diphenyl-amine) (poly-TPD) / 
poly(9,9-bis(3’-(N,N-dimethyl)-N-ethylammonium- 
propyl-2,7-fluorene)-alt-2,7-(9,9-dioctylfluorene)) 
dibromide (PFN-Br) / perovskite / LiF / Cgo / 
bathocuproine (BCP) / Ag. The statistical com- 
parison of devices (fig. S12) showed that triple- 
halide devices performed better than our best 
Cs25Br20 controls for all PV metrics. The best- 
performing triple-halide device with composi- 
tion Cs22Br15+Cl3 (Fig. 4B) exhibited a PCE of 
20.42% and V,, of 1.217 V Lee = 20.18 mA cm, 
fill factor (FF) = 83.16%, active area = 0.06 cm?], 
whereas the best-performing control showed 
a PCE of 18.16% and V,, of 1.114 V. Both the 
control and triple-halide devices exhibited no 
hysteresis in J-V sweeps. By integrating the 
EQE spectrum, weighted by the AM1.5G solar 
spectrum (Fig. 4G), we calculated a current 
density of 20.6 mA cm”, in good agreement 
with the J,. obtained in the J-V test. The 
stabilized efficiency was 20.32% after con- 
tinuous operation at MPP for 5000 s (Fig. 4C). 
Table S1 summarizes reported p-i-n wide- 
band gap cells for tandems (2, 24, 47, 54, 59, 60). 
The triple-halide devices considerably improved 
the V,. deficit, which was reduced from >0.5 V 
to ~0.45 V, as well as the PCE, which increased 
to >20%. 

To construct tandems on top of Si cells, we 
developed top-illuminated semitransparent de- 
vices that maximized both the near-infrared 
transparency and PV performance. The device 
structure is illustrated in Fig. 4D. The metal 
top contact (Ag or Au) in opaque cells was re- 
placed with an 8-nm-thick SnO, / Zn:SnO, 
(ZTO) buffer layer (2, 41, 54, 61, 62) coated by 
atomic-layer deposition (ALD) and an ITO win- 
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Fig. 2. Enhanced charge-carrier mobility and lifetime in triple-halide perovskite films. (A) Photoconductivity 
transient (indicative of the lifetime) measured by time-resolved microwave conductivity (TRMC), indicating 
doubled charge-carrier lifetime (t) in the 1.67-eV triple-halide film (Cs22Br15+Cl3, red line) compared with the 
control film (Cs25Br20, blue line). (B) Photoconductivity under different excitation intensity. eS “u denotes the yield- 
mobility product. The carrier-generation yield « is near unity for perovskite films and therefore ~)—u is a measure of 
mobility. Triple-halide films exhibit nearly doubled mobility relative to control double-halide films. Perovskite/LiF 
bilayer samples exhibit mobilities and lifetimes similar to their corresponding bare perovskite samples. 


dow layer to allow light through. Parasitic ab- 
sorption and reflection loss created by layers in 
the top electrode were minimized (fig. S13) by 
using a polydimethylsiloxane (PDMS) scatter- 
ing layer to limit reflection (67) and by reducing 
the thickness of the Cg and ITO layers to 10 
and 60 nm, respectively (fig. S14). 

To reduce the resistance of large-area devi- 
ces, we evaporated a metal grid on top of the 
ITO window layer. Note that in top-illuminated 
devices, J,. is typically lower than the EQE 
integrated current density because of the shading 
losses created by the metal grid. To minimize 
this loss, we used a metal shadow mask with a 
graded opening to reduce the shadow effect that 
occurs when metals are evaporated through 
high-aspect ratio holes in a mask (fig. S15). The 
graded-opening mask enabled us to construct 
~25-um-wide metal gridlines with a height of 
~3 um using thermal evaporation (fig. S16), 
which reduced the shading loss to below 0.8% 
without compromising FF. 

With these improvements, our 1.67-eV top- 
illuminated semitransparent cells simultane- 
ously achieved ~80% near-infrared transmittance 
(Fig. 4G) and PCEs of 18.59% (Fig. 4E). The cor- 
responding stabilized PCE was 18.52% in the 
MPP tracking test (Fig. 4E, inset). The V,, of 
1.214 V and FF of 80.2% in 0.34-cm? semi- 
transparent devices were near those achieved 
in the best 0.06-cm? opaque devices with a 
metallic electrode (Fig. 4B and Table 1), high- 
lighting the quality and stability of triple-halide 
perovskites during ALD buffer layer deposition 
at 85°C and subsequent ITO sputter deposition. 
The active-area J,. determined from the EQE 
spectrum (Fig. 4G) was ~19.5 mA cm”, which 
agrees well with the aperture-area J,. of 
19.13 mA cm”? from J-V tests. The loss of Jg¢ in 
semitransparent top cells relative to opaque de- 


vices primarily resided in the long-wavelength 
region around the band edge and arose from 
the lack of reflection from the metal contact. 

We also fabricated large-area (1 cm”) semi- 
transparent top cells. One such device was mea- 
sured by an accredited PV laboratory (National 
Renewable Energy Laboratory PV Device Per- 
formance) and certified at a stabilized PCE of 
16.83 + 0.34% in an 11-point SPO (stabilized 
power output) test. This value appears to be 
the highest certified PCE reported for a 1-cm? 
semitransparent perovskite cell for tandems 
(table S2 and figs. S17 and S18). The perform- 
ance metrics of the J-V curve were V,, = 1.202 V, 
Jgc = 18.99 mA cm”, FF = 74.07%, and PCE = 
16.90% (Fig. 4F and Table 1). The nearly identical 
PCEs measured in a J-V test and SPO test 
further verified the negligible hysteresis and 
stability of our triple-halide perovskite devices. 
The primary PCE loss in the large-area (1 cm?) 
devices relative to small-area (0.34 cm?) de- 
vices resided in FF, which is attributed to the 
limited sheet resistance (15 ohms per square) 
of ITO substrates used in this study; this limi- 
tation does not apply to two-terminal tandems 
that transport current vertically through the 
recombination layer between the top and bot- 
tom cells. Our semitransparent top cells ex- 
hibited both the highest PCE and highest V,, 
of reported semitransparent top cells for tandems 
on Si, CIGS (copper indium gallium diselenide), 
or narrow-band gap perovskites (table S2) 
(24, 41, 54, 60, 63-72). 

We examined the long-term operational 
stability of triple-halide devices under the 
combined stresses of heating and illumina- 
tion (~0.77-sun sulfur plasma lamp, chamber 
temperature T = 60°C; see supplementary ma- 
terials for details). An unencapsulated opaque 
device with metal contacts tested in ambient 
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air (relative humidity = 25 to 30%) retained 
90% of its initial PCE after 250 hours of con- 
tinuous operation at MPP (Fig. 4H). The thermal 
stability of the triple-halide material was also 
tested by keeping opaque devices with gold 
contacts at 85°C for 500 hours in nitrogen; the 
unencapsulated triple-halide devices and double- 
halide devices both retained >97% of their initial 
efficiency (fig. S19). Semitransparent top cells 
tested in Nz, exhibited negligible degradation 
(<4%) after 1000 hours of continuous operation 
at MPP near 60°C (Fig. 4H). This result con- 
firmed the importance of encapsulation (No 
in this case) and the role of barrier layers [an 
ALD buffer layer and a transparent conduc- 
tive oxide (TCO) layer in this case] to suppress 
the reactions between metal contacts and 
halides, as revealed in previous reports (73-75). 

We integrated our triple-halide perovskite 
in I-cm? two-terminal monolithic tandems on 
silicon cells (see photos in fig. S20). Here, the 
ITO/glass substrate of the semitransparent 
device (Fig. 4D) was replaced by a front-side 
polished silicon heterojunction bottom cell 


0 Time (min) 


capped with a 20-nm-thick ITO recombination 
layer, which allowed for solution processing of 
the perovskite layer rather than vapor deposi- 
tion that would be required to deposit the 
perovskite on a pyramidally textured silicon 
surface (65, 76). With the aid of proper light 
management, tandems on planar front-side 
wafers have been shown to deliver J,, values 
competitive with those on fully textured wafers 
(77). Additionally, front-side planar substrates 
are industrially relevant, because PERC (passi- 
vated emitter and rear cell) silicon cells, which 
constitute ~70% of the silicon PV market, have 
a planar “shiny-etched” rear surface that was 
recently shown to allow for solution process- 
ing of the perovskite solar cell (78). 

The full tandem structure is illustrated in 
Fig. 5A. Figure 5B shows a cross-sectional SEM 
image of the perovskite top cell on Si. Here, 
we developed a NiO, / poly-TPD bilayer hole 
transport layer (HTL) for use in the tandems. 
Relative to HTLs that used only spin-cast poly- 
TPD, introducing a 20-nm-thick sputtered NiO,, 
layer between the ITO-HTL interface reduced 
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shunting and increased the yield of 1-cm? tan- 
dems. In a J-V sweep, the best-performing tan- 
dem reached a PCE of 27.13% (Vo, = 1.886 V, 
Jee = 19.12 mA cm”, and FF = 75.3%). The 
corresponding stabilized PCE at MPP is 27.04% 
(Fig. 5C and Table 1), exceeding the current 
record efficiency (26.7%) for Si single cells and 
approaching the world record value of 28% 
achieved by Oxford PV (15, 16). The V,, of >1.88 V 
is one of the highest reported in perovskite/Si 
tandems with PCE of >25% (59, 61, 65, 76, 77) 
(see summary of =1 cm?-area tandems in table 
$3), providing further evidence of the low V,. 
deficit achieved in the triple-halide semitrans- 
parent top cells (table S2). The EQE spectrum 
of the perovskite top cell matches well with 
those measured in top-illumination semitrans- 
parent devices (Fig. 5D). The active-area J,. 
values calculated from the EQE spectra were 
19.3 mA cm” and 19.9 mA cm” for the perov- 
skite top cell and Si bottom cell, respectively 
(Fig. 5D). These calculated values indicate that 
the J,, of our two-terminal tandem was slightly 
limited by the perovskite top cell. 
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Fig. 3. Suppression of photoinduced phase segregation in triple-halide perovskites. (A and B) PL spectra of 1.67-eV control perovskite films (Cs25Br20) under 
10-sun and 100-sun illumination for 20 min, respectively. Arrows indicate the direction of the PL shift over time. (€) The shift of the spectral centroids of control films over time. 
The red shift becomes more obvious under higher injection. (D and E) PL spectra of 1.67-eV triple-halide perovskites (Cs22Br15+CI3) under 10-sun and 100-sun illumination 
for 20 min, respectively. (F) The shift of the spectral centroids of triple-halide perovskites over time. The blue shift becomes more obvious under higher injection level. 
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We also analyzed each subcell in our two- 
terminal tandems by measuring electrolumi- 
nescence (EL) (Fig. 5E). After increasing the 
injection current density from 1 mA cm” to 
100 mA cm”, the EL peak from the perovskite 
retained its shape and location, verifying the 
phase stability of triple-halide perovskites (Fig. 
5F). The subcell’s V,. was estimated from EL 
quantum efficiency based on the reciprocity 


principle (Fig. 5G) (79). At an injection level | efficiency (25.8%) is highly consistent with 
of 17 to 20 mA cm” (near Jee and Jmpp), the | that measured in our lab (26%) (fig. S21A), 
respective V,, values of the perovskite and Si | verifying our rigorous protocol of in-house 
subcells were estimated to be 1.2 Vand 0.7V, | tandem measurement. From the comparison 
aligning well with the tandem V,,. of ~1.9 V | of EQEspectra (fig. S21B), the J,, loss relative 
from J-V measurements. to our champion tandem (Fig. 5D) is likely 

A two-terminal tandem device was certified | caused by the absorption from a thicker Cgg 
after being tested in air and stored in Nz for | top layer, as discussed above (figs. S13 and S14). 
1 month (figs. S21 and S22). The certified | Even with this J,, loss, it exhibited a V,, value 
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Fig. 4. PV characteristics of single-junction opaque devices and top metal grid and TCO layer side. The aperture area is 0.34 cm”. The inset shows 
semitransparent top cells. (A) Schematic of the opaque device with reflective continuous MPP operation for 100 s, exhibiting stabilized PCE of 18.52%. (F) A 1-cm? 
metal contact (not to scale). (B) J-V curves of the champion 1.67-eV triple- top-illuminated semitransparent device certified by NREL. The circles are the 11-point 


halide opaque device (red) and best control device (blue). The aperture area is — SPO (stabilized power output) measurements, showing stabilized PCE of 16.83%. 
0.06 cm’. Solid lines indicate the reverse scan (from open circuit to short circuit); TheJ-V curve shows the instantaneous PCE of 16.9%: the inset is a photo of the device. 


dashed lines indicate the forward scan (from short circuit to open circuit). (G) EQE spectra of an opaque device (blue) and a top-illuminated semitransparent 
(C) Continuous maximum-power-point (MPP) operation of the champion device (red). The green curve is the optical transmission of the top-illuminated 
triple-halide wide-band gap device for 5000 s, exhibiting stabilized PCE of semitransparent device, including the metal grid shading losses. (H) Long-term 
20.32%. (D) Schematic of the top-illuminated semitransparent device (not to continuous MPP tracking under accelerated conditions (0.77-sun illumination and 
scale). Light enters the cell through the top metal grid and TCO layer. 60°C) of 1-cm* semitransparent (red, tested in No, averaged from two devices in the 
(E) J-V curves of the champion semitransparent device illuminated from the same batch) and opaque devices (blue, tested in air with relative humidity of ~30%). 
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Fig. 5. PV characteristics of 1-cm* two-terminal perovskite/Si tandems. 

(A) Schematic of the two-terminal monolithic tandem structure (not to scale). 
a-Si:H denotes hydrogenated amorphous silicon. (B) Cross-sectional SEM 
image of a two-terminal tandem. (C) Light and dark J-V curves and MPP tracking 
(inset) of the champion tandem. (D) EQE spectra of the perovskite top cell 
(blue) and silicon bottom cell (red) of the champion tandem. (E) Schematic 


(1.88 V) that is much improved over the best 
prior reports (<1.8 V) (table S3). Finally, the 
statistical data of tandems (fig. S23) indicates 
that our method was reproducible. 


Outlook 


We have demonstrated a low-cost tandem 
technology to increase the power output of a 
~21% silicon solar cell by ~30%, which has 
major implications given that silicon currently 
accounts for >90% of the global PV market, 
which mainly consists of 19 to 21% single- 
junction silicon solar modules. Furthermore, 
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triple-halide wide-band gap perovskites offer 
a promising path to >30% tandems because of 
their reduced V,, deficit and record perform- 
ance in wide-band gap semitransparent top 
cells. Although we have used a planar tandem 
architecture as a model in this study, the 1.67-eV, 
triple-halide perovskite composition also has 
an ideal band gap for textured perovskite/ 
silicon tandems (80), with potential for a J,. 
value of ~20 mA cm. Both CsPbBr,Cl3_» Derov- 
skites and Cs,,FA,MAj_»»»Pbl,Br3_. perovskites 
with tunable band gaps have been demonstra- 
ted by co-evaporation or two-step interdiffusion 
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diagram of electroluminescence (EL) measurement of a two-terminal tandem. 
(F) EL spectra of the tandem under different injection levels. The triple-halide 
perovskite exhibits a stable EL peak up to injection levels of 100 mA cm™. 

(G) The V,, contribution of each subcell estimated from EL quantum efficiency. 
The injection region of 17 to 20 mA cm” (near J, and Jimpp under 1-sun 
illumination) is highlighted in green. 


methods (65, 76, 81), so there is no fundamental 
barrier to realizing triple-halide perovskites 
using vapor deposition methods required for 
fully textured surfaces. A perovskite/Si tandem 
with a J. of ~20 mA cm™~ and the V,, and FF 
of triple-halide semitransparent cells presented 
here could have >30% PCE. 

The ability to further tune the band gap 
to higher energies shows promise for other 
multijunction architectures as well, such as 
perovskite/perovskite and perovskite/CIGS 
tandems. Beyond tandems, the demonstrated 
suppressed photoinduced phase segregation 
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and improved optoelectronic properties result 
in stable electroluminescence at high injection 
current densities and stabilized perovskite PL 
at 100-sun-equivalent illumination, highlight- 
ing opportunities for the use of this material in 
LED or concentrated PV applications. More 
broadly, the exploration of triple-halide perov- 
skites has identified a promising new region 
of perovskite single-phase stability and paves 
the way for another dimension of composi- 
tional engineering for perovskites. 
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A central master driver of psychosocial stress 


responses in the rat 


Naoya Kataoka, Yuta Shima, Keisuke Nakajima, Kazuhiro Nakamura* 


The mechanism by which psychological stress elicits various physiological responses is unknown. 

We discovered a central master neural pathway in rats that drives autonomic and behavioral stress 
responses by connecting the corticolimbic stress circuits to the hypothalamus. Psychosocial stress 
signals from emotion-related forebrain regions activated a VGLUT1-positive glutamatergic pathway from 
the dorsal peduncular cortex and dorsal tenia tecta (DP/DTT), an unexplored prefrontal cortical area, 
to the dorsomedial hypothalamus (DMH), a hypothalamic autonomic center. Genetic ablation and 
optogenetics revealed that the DP/DTT—-DMH pathway drives thermogenic, hyperthermic, and 
cardiovascular sympathetic responses to psychosocial stress without contributing to basal homeostasis. 
This pathway also mediates avoidance behavior from psychosocial stressors. Given the variety of stress 
responses driven by the DP/DTT—-DMH pathway, the DP/DTT can be a potential target for treating 


psychosomatic disorders. 


sychological stressors induce various 
physiological responses by stimulating 
sympathetic, neuroendocrine, and behav- 

ioral mechanisms. Stress-induced sym- 
pathetic stimulation of thermogenic 

and cardiovascular functions is common in 
mammals and is recognized as a stress-coping 
response to boost physical performance in 
fight-or-flight situations. However, excessive 
stress may cause aberrant sympathetic symp- 
toms such as psychogenic fever, a hyperther- 
mia caused by intense psychological stressors 
(2. Sympathetic thermogenesis in brown adi- 
pose tissue (BAT) contributes to stress-induced 
hyperthermia in rats and humans (2, 3). Re- 
sistance of psychogenic fever to antipyretics 
indicates an etiology different from that of 
infection-induced fever (7). Patients with post- 
traumatic stress disorder exhibit augmented 
cardiovascular responses to stress, which in- 
crease the risk of developing hypertension and 
cardiovascular diseases (4). Vigorous psycho- 
genic cardiovascular responses are also observed 
in panic disorder, a severe anxiety disorder (5). 
However, the central circuit mechanisms by 
which psychological stress activates the sympa- 
thetic nervous system are not well understood. 
The dorsomedial hypothalamus (DMH) is 
an important brain site for sympathetic stress 
responses (6, 7). A hypothalamo-medullary path- 
way from the DMH to sympathetic premotor 
neurons in the rostral medullary raphe region 
(rMR) mediates BAT thermogenic and cardio- 
vascular responses to psychological stress (2, 8). 
However, the mechanism by which the cortico- 
limbic circuits that process stress and emotion 
activate the sympathoexcitatory DMH neurons 
remains unknown. We sought to identify the 
forebrain neural pathways that transmit stress 
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signals to the DMH by using functional neuro- 
anatomy and in vivo physiological experiments 
combining genetic lesion and optogenetic ma- 
nipulations of a pathway of interest. We also 
employed social defeat stress (SDS), an animal 
model of psychosocial stress caused by inter- 
individual relationships that mimics human 
social stress (9). 


SDS activates DMH-projecting neurons 
in the mPFC 


To label neurons that project to the DMH, we 
injected cholera toxin b subunit (CTD), a ret- 
rograde neural tracer, into the rat DMH and 
consequently found many CTb-labeled neu- 
rons in the medial prefrontal cortex (mPFC) 
(Fig. 1, A to C). Subsequent exposure to SDS 
significantly increased expression of Fos, a 
marker for neuronal activation, in CTb-labeled 
neurons in specific mPFC subregions: the pre- 
limbic cortex (PrL), infralimbic cortex (IL), and 
dorsal peduncular cortex (DP) and the dorsal 
tenia tecta (DTT) immediately rostral to the 
callosal commissure (Fig. 1, D to F). These 
double-labeled cells were mostly distributed 
in layers V and VI and showed the typical mor- 
phology of glutamatergic pyramidal neurons 
(Fig. 1E). We did not find other forebrain re- 
gions containing a substantial double-labeled 
population after SDS. 


The DP/DTT mediates thermogenic and 
cardiovascular sympathetic responses 
to stress 


The PrL and IL, which constitute the main 
body of the mPFC (JO), exert inhibitory effects 
on stress responses (11, 12). Correspondingly, 
optogenetic stimulation of the IL—amygdala 
or PrL—bed nucleus pathway reduces freez- 
ing behavior in response to stress (13, 14). No 
literature is available on the function of the 
DP and DTT (hereafter, the DP/DTT). Therefore, 
we determined the roles of the stress-activated 
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mPFC subregions in driving sympathetic stress 
responses. 

SDS induced immediate increases in inter- 
scapular BAT temperature (7,7) and body core 
(abdominal) temperature (Tore) (Fig. 2), repre- 
senting psychosocial stress-induced BAT ther- 
mogenesis and hyperthermia, respectively (8). 
These responses were abolished by inactivation 
of DP/DTT neurons with bilateral nanoinjec- 
tions of muscimol: Responses were nearly sup- 
pressed by injections of 200 nl per site (Tpar: 
80% inhibition, T.,,¢: 88% inhibition; Fig. 2, A 
to F) and were moderately inhibited by in- 
jections of 100 nl per site (Tgay: 63% inhibi- 
tion, Teore: 54% inhibition; fig. S1). In contrast, 
muscimol injections of 200 nl per site into the 
IL resulted in significantly reduced inhibition 
of the responses (Tpar: 33% inhibition, Tyore! 
26% inhibition; Fig. 2, G to L) compared with 
injections into the DP/DTT (unpaired ¢ tests 
comparing inhibition percentage; Tpar: ts = 
3.16, P < 0.053 Teore: tg = 2.82, P < 0.05). SDS 
also induced increases in heart rate (HR) and 
mean arterial pressure (MAP), which were di- 
minished by injections of muscimol (100 nl 
per site) into the DP/DTT (HR: 53% inhibi- 
tion, MAP: 77% inhibition; Fig. 2, M to Q). 


DP/DTT neurons drive sympathetic responses 
through activation of the DMH—rMR pathway 


To test whether DP/DTT neurons transmit 
stress signals to the DMH to drive sympathetic 
responses, we performed in vivo electrophysio- 
logical experiments in anesthetized rats. Stim- 
ulation of DP/DTT neurons with nanoinjection 
of bicuculline robustly increased BAT sym- 
pathetic nerve activity (SNA), Tpar, HR, and 
MAP (fig. S2, A to E) (15). All of these responses 
were reversed by subsequent inactivation of 
DMH neurons with muscimol injections (fig. 
S2, A to E). 

We next investigated whether the stimulation 
of DP/DTT neurons activated the DMH-rMR 
pathway. CTb injection into the rMR (fig. S2F) 
retrogradely labeled many neurons that bilat- 
erally clustered in the dorsal part of the DMH 
(fig. S2, H and I); these include sympathoex- 
citatory glutamatergic neurons driving cold- 
defensive, febrile, and stress responses through 
their innervation of sympathetic premotor 
neurons in the rMR (8, 16-18). Subsequent 
stimulation of DP/DTT neurons with a uni- 
lateral bicuculline injection under free-moving 
conditions increased Fos expression in the CTb- 
labeled population in the DMH, mainly ipsi- 
laterally to the site of bicuculline injection (fig. 
82, G to J). 


Glutamatergic innervation of DMH neurons by 
DP/DTT neurons 


Next, we determined the neurotransmitter that 
mediates the DP/DTT—DMH transmission. 
Transduction of DP/DTT neurons with palGFP, 
amembrane-targeted form of green fluorescent 
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Fig. 1. SDS activates DMH-projecting neurons in the mPFC. (A) CTb injection 
into the DMH. 3V, third ventricle; f, fornix; mt, mammillothalamic tract; opt, optic 
tract. Scale bar, 500 um. (B) CTb injection sites in the rats for which data are 
plotted in (F). VMH, ventromedial hypothalamic nucleus. (©) CTb-labeled neurons 
(arrowheads) in the mPFC. fmi, forceps minor of the corpus callosum; LV, lateral 
ventricle. Scale bars, 500 um (main panel); 30 um (inset). (D) Distribution of 
CTb-labeled cells with or without Fos immunoreactivity in the mPFC after sham 
handling (Control) or SDS (Stress). ac, anterior commissure; Acb, nucleus 


protein (GFP) (19), using adeno-associated virus 
(AAV) (Fig. 3A) allowed us to visualize their 
numerous axons projecting to the DMH. We 
found close association of these axons with 
DMH neurons retrogradely labeled with CTb 
from the rMR (Fig. 3, B and C, and fig. S3A), 
indicating putative synapses formed by DP/DTT- 
derived axons on DMH-rMR projection 
neurons. Confocal imaging revealed that the 
swellings of DP/DTT-derived axons apposed 
to rMR-projecting DMH neurons contained 
VGLUTI (vesicular glutamate transporter 1), 
a marker for cortical glutamatergic neurons, 
but not VGLUT2, a subcortical glutamatergic 
neuron marker, or VGAT (vesicular GABA trans- 
porter), a GABAergic neuron marker (Fig. 3, D 
to F, and fig. S3B). 

Consistent with the view that glutamatergic 
inputs to the DMH mediate stress responses, 
blockade of glutamatergic synapses in the DMH 
with nanoinjections of a mixture of 2-amino-5- 
phosphonovaleric acid and cyanquixaline (AP5/ 
CNQX) inhibited SDS-induced sympathetic re- 
sponses for the first 20 min of stress exposure 
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(Tgar: 88% inhibition, T.o,¢: 80% inhibition, HR: 
56% inhibition, MAP: 41% inhibition; fig. $3, 
C to F), after which the antagonistic effects 
of the drugs waned. The difference in the in- 
hibitory effects of AP5/CNQX on the thermal 
and cardiovascular responses may be because 
the distribution of cardiovascular neurons in 
the DMH is broader than that of thermoreg- 
ulatory neurons (20). 


In vivo optogenetic stimulation of DP/ 
DTT—DMH glutamatergic transmission elicits 
sympathetic responses 


To directly examine the sympathoexcitatory role 
of the DP/DTT—DMH pathway, we selectively 
stimulated DP/DTT—-DMH monosynaptic trans- 
mission by using an in vivo optogenetic tech- 
nique (Fig. 3G). ChIEF, a channelrhodopsin 
variant (27), was used to induce membrane 
depolarization and action potentials in neurons 
with exposure to light. AAV transduction of 
DP/DTT neurons with ChIEF-tdTomato or 
palGFP resulted in the localization of the ex- 
pressed proteins in their cell bodies in the 
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accumbens; Cg, cingulate cortex; E, ependymal and subependymal layer; M2, 
secondary motor cortex; Nv, navicular nucleus; OV, olfactory ventricle. 

(E) Stress-induced Fos expression (blue-black) in CTb-labeled (brown) cells in 
the DP (arrows). Note the typical pyramidal neuron morphology. Scale bars, 
200 um (main panel); 30 wm (inset). (F) Percentages of Fos-immunoreactive 
cells in CTb-labeled populations in mPFC subregions (n = 4 rats per group). 

*P < 0.05; **P < 0.01; ***P < 0.001 (unpaired t test; PrL, te = 6.16; IL, te = 5.93; 
DP, te = 7.41; DTT, te = 2.47). Error bars indicate SEM. 


DP/DTT and nerve endings densely distributed 
in the DMH (Fig. 3, H and I, and fig. $4, A and B). 
Illumination of ChIEF-tdTomato-containing 
nerve endings, but not palGFP-containing 
nerves, in the DMH with pulsed blue laser 
light for 30 s consistently increased BAT SNA, 
Tgat, HR, and MAP in anesthetized rats (Fig. 
3, J and K, and fig. S4C). Longer illumination 
(180 s) elicited larger increases, and the photo- 
stimulated BAT thermogenesis was as intense 
as that induced by SDS (fig. S4, D and E, and 
table S1). Photostimulation of DP/DTT—DMH 
transmission activated the DMH—rMR path- 
way (fig. S4, G to I). Consistent with the view 
that DP/DTT—DMH transmission driving the 
responses is glutamatergic, antagonizing glu- 
tamate receptors in the DMH eliminated all 
sympathetic responses elicited by photostimu- 
lation of DP/DTT—DMH transmission (Fig. 3, 
Lto O, and fig. S4F). 

DP/DTT-derived nerves with ChIEF-tdTomato 
were also distributed in the lateral hypo- 
thalamic area (LH) (Fig. 31), which contains 
orexin neurons proposed to participate in 
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Fig. 2. Neurons in the DP/DTT, rather than the IL, mediate sympathetic stress 
responses. (A to Q) Effects of bilateral muscimol nanoinjections into the DP/DTT 
[(A) to (F) and (M) to (Q)] or the IL [(G) to (L)] on SDS-evoked BAT thermogenesis 
and hyperthermia [(A) to (L)] or cardiovascular responses [(M) to (Q)]. After 
saline or muscimol injections [1 mM; (A) to (L): 200 nl per site, (M) to (Q): 100 nl per 
site; marked by green microspheres in (A) and (G); scale bars, 500 um] into the 
DP/DTT [(A), (B), and (M)] or the IL [(G) and (H)] (the right side of the symmetric 
bilateral injections is shown), animals were exposed to SDS [gray zones in (C), 

(E), (I), (K), (N), and (P)]. All rats (n = 5 per group) underwent two stress trials, 
>1 week apart, with saline (first trial) and muscimol (second trial) injections at the 
same sites. Time-course changes in Tpat, Toore, HR, and MAP were analyzed by 
repeated measures two-way analysis of variance (ANOVA) [(C): injectant: F,.4 = 19.85, 
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P = 0.011, time: Fig0720 = 8.976, P < 0.001, interaction: Fig9720 = 8.68, P < 0.001; (E): 
injectant: Fy, = 36.89, P = 0.004, time: Figo720 = 10.93, P < 0.001, interaction: 
Figo720 = 10.38, P < 0.001; (1): injectant: Fy.4 = 22.5, P = 0.009, time: Figo729 = 25.47, 
P < 0.001, interaction: Fig9720 = 4.63, P < 0.001; (K): injectant: Fy,4 = 0.35, P = 
0.585, time: Figo720 = 12.64, P < 0.001, interaction: Figg9729 = 2.07, P < 0.001; (N): 
injectant: Fy,.4 = 27.99, P = 0.006, time: Figo720 = 15.54, P < 0.001, interaction: 
Figo720 = 2.48, P < 0.001; (P): injectant: Fy,4 = 55.59, P = 0.002, time: Figo720 = 11.16, 
P < 0.001, interaction: Fig9720 = 2.47, P < 0.001] followed by Bonferroni's post hoc 
test (red bars with asterisks indicate time points with significant differences). Area 
under the curve (AUC) during the stress period was analyzed by paired t test [(D): t4 = 
7.57. (F): tg = 5.93; (J): ta = 3.46; (L): ta = 139; (0): ty = 4.39; (Q): ty = 6.92]. *P < 
0.05; **P < 0.01; ***P < 0.001. Error bars indicate SEM. bpm, beats per minute. 
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Fig. 3. Optogenetic stimu- 
lation of DP/DTT—-DMH 
glutamatergic transmis- 
sion elicits sympathetic 
responses. (A and B) 
Injections of AAV-CMV- 
palGFP into the DP/DTT 
(A) and CTb into the rMR 
(B). The inset in (A) shows 
a palGFP-expressing neu- 
ron. py, pyramidal tract; 
RMg, raphe magnus nucleus; 
rRPa, rostral raphe pallidus 
nucleus. Scale bars, 500 um 
(main panels); 20 um 
(inset). (C) palGFP-labeled 
axons closely associated 
with a DMH neuron 
retrogradely labeled with 
CTb. Scale bar, 10 um. 

(D to F) Pseudocolored 
confocal images showing 
palGFP-labeled axon 
swellings (arrowheads) 
apposed to CTb-labeled cell 
bodies (asterisks) in the 
DMH. VGLUT1 (D) (filled 
arrowheads), but not 
VGLUT2 (E) or VGAT (F) 
(hollow arrowheads), was 
detected in these axon 
swellings. See also fig. S3. 
Scale bars, 5 um. (G) In 
vivo optogenetic stimulation 
of DP/DTT—+DMH nerve 
endings. (H and 1) DP/DTT 
neurons transduced with 
ChIEF-tdTomato (H) and 
their axons containing 
ChIEF-tdTomato in the DMH 
[(I); immunoperoxidase 
staining]. Scale bars, 
500 um. (J) Representative 
example of BAT thermo- 


genic and cardiovascular 
responses elicited by 
bilateral laser illumination of 
DP/DTT—>DMH nerve end- 
ings with ChIEF-tdTomato. 
AP, arterial pressure. 
(K) Changes in physiologi- 
cal variables induced by 
illumination of selective 
pathways (one-way ANOVA 
followed by Bonferroni's 
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post hoc test, n = 5 per group; ABAT SNA: F316 = 28.74, P < 0.001; ATgart: F3i¢ = 30.46, P < 0.001; AHR: F3i¢ = 17.27, P < 0.001; AMAP: F3ig = 8.29, P = 0.002). 
*P < 0.05; **P < 0.01; ***P < 0.001. See also fig. S4. (L) Experiment to test the effects of blockade of glutamatergic synapses in the DMH on physiological 
responses to photostimulation of DP/DTT—>DMH transmission. (M) Each circle indicates a site of saline and AP5/CNQX injections made at the same location in 
H of each rat (saline was always injected first). The right side of symmetric bilateral injections is shown. (N) AP5/CNQxX injections into the DMH eliminated 
sympathetic responses elicited by photostimulation of DP/DTT—+DMH nerve endings (compare with fig. S4F, which shows a result for the saline control). (0) Changes 
in physiological variables elicited by photostimulation of the DP/DTT—DMH pathway after saline or AP5/CNQX injections into the DMH (n = 5 per group). 


*P < 0.05; ***P < 0.001 (paired t test; ABAT SNA: ta = 13.99; AT gar: tq = 3.98; AHR: ty = 4.07; AMAP: ta = 2.86). Error bars indicate SEM. 
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sympathetic stress responses (22). However, 
photostimulation of DP/DTT—LH transmis- 
sion elicited no or subtle changes in the phys- 


Additionally, no response was elicited by photo- 
stimulation of IL—DMH axonal projections 
with ChIEF-tdTomato (Fig. 3K and fig. S4, 


DP/DTT—-DMH transmission is essential for 
sympathetic responses to psychosocial stress 
We next employed genetic approaches to deter- 


iological variables (Fig. 3K and fig. S4J). | K to M). mine the importance of the DP/DTT>DMH 
opalGFP_ @ palGFP AVG 
A B i Cc D @ faCaspS SteCasp3 AVG 
Cre-expressing 2.0; —palGFP 120 
AAV-Ef1a-flex DMH iecti 4.5} ~taCasp3 S 
-taCasp3-TEVp AAVrg-pgk-Cre ( -projecting) : £100) -- 0.96 oo G 
neurons Ge 10 9 gol P=0.01 ‘¢, 
© 0.5 © +o 
palGFP taCasp3 z 9 2 60 
O--. - : : RK 
DP/DTT f ore 4 Sy) < & 40 
j -0.5 a 
' IL -~1.0 < 20} g2* 
t " r Spinal ‘i ; aa | ee 3 : _ ! 
oo cord 4 fim --[fmi_ 135539030 60 00 120 «100200 300400500 
| era a a _ =) Time (min) Cre+ cell bodies in DP/DTT 
\ Bap ' A) 2.0; —palGFP 120 
canes ae ipo £100 
+ Cre eee, ie ate q a 1:0 6 80 ee) 
eat ’ mn 2 05 > 60 0.74 ogo 
v 1 Yia! ms 2 S 0.16 
“vy Dit: Ae “OTT fe 3 0 > 40 . é 
[RH Left > Pp [lacasps-Tevp >< | wPRE EEE in 13 bGH polyA ee ees oS) AN a Me © 
Vote ees We Loy “5 eee 
an com | -1.0 < 0 
Apoptosis 2 ; | 
nee 1555 -30 0 30 60 90 120 a 100 200 300 400 500 
Time (min) Cre+ cell bodies in DP/DTT 
AAV-Ef1a-DIO F 7 
-iChloC-mCherry AAVrg-pgk-Cre Optical fiber F paler HOI E SINeny 


> 3 weeks 


DP/DTT 


> Ausayoui-90]y9! K| bGH polyA 
+ Cre 
iChloC-mCherry <| | bGH polyA 
G Laser 
Laser ~~ 
250 * 


—palGFP 


iChloC-mCherry- 
expressing neuron 


—palGFP £1,500 


iChloC-mCherry 
Cre 


—palGFP 
—iChloC 


—iChloc £& = 
200 a eae 
= r aa 
E150 € 20 £1,000 
2 r= ‘= 
~=100 ~ 40 = 
¢ & S 
= 50 = olf z 500 
Ops 4 a 
Stress Stress < Stress 
-50 ———— -10 —— 3 0 0.5 ——— 
-60 -30 0 30 60 90 120 -60 -30 0 30 60 90 120 & -60 -30 0 30 60 90 120 
Time (min) Time (min) $ B Time (min) 


Fig. 4. Selective lesion or inhibition of the DP/DTT—-DMH pathway 
abolishes sympathetic stress responses. (A) Scheme for genetic lesions 
of DP/DTT—+DMH neurons. ITR, inverted terminal repeat; WPRE, woodchuck 
hepatitis virus posttranscriptional regulatory element; bGH polyA, bovine growth 
hormone polyadenylation signal. (B and C) Representative examples of palGFP 

or taCasp3 transduction of DP/DTT—+DMH neurons from a single rat. (B) Cre- 
immunopositive (DMH-projecting) neurons were eliminated exclusively in the DP/DTT 
after taCasp3 transduction. Arrowheads indicate deep layers of the DP/DTT. Scale 
bar, 500 um. (C) This lesioned rat failed to exhibit thermal responses to SDS 

(gray zone). (D) Relationship between the number of Cre-positive neurons remaining 
in the DP/DTT and stress responses (AUC of ATpar and ATeo-¢ during stress; see 
also time-course data in fig. S5A). Data from taCasp3 rats were subjected to linear 
egression analysis (Pearson's correlation test). The averages (AVG) of AUC were 
compared for the palGFP and taCasp3 groups (unpaired t test; n = 5 per group; ATpat: 
tg = 4.30; AT core: tg = 5.81). **P < 0.01, ***P < 0.001. Arrows indicate the cases 
plotted in (C). (E) In vivo optogenetic inhibition of DP/DTT—DMH neurons. Many 
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Cre-immunopositive (DMH-projecting) DP/DTT neurons expressed iChloC-mCherry. 
Scale bar, 20 um. (F) DP/DTT neurons were transduced with palGFP or iChloC- 
mCherry (top), and their axons with the proteins were densely distributed in the DMH 
(bottom; immunoperoxidase staining). Scale bars, 500 um (top); 20 um (bottom). 
(G) Illumination of DP/DTT—DMH nerve endings with iChloC-mCherry for the first 
10 min (blue zone) of SDS inhibited cardiovascular and hyperthermic stress responses. 
Increases in HR and MAP after SDS were caused by stress from returning to the home 
cage. Time-course changes in the variables were analyzed by repeated measures 
two-way ANOVA (n = 5 per group; AHR: group: Fig = 23.7, P = 0.001, time: Figois4o = 
23.55, P < 0.001, interaction: Figoia4o = 9.68, P < 0.001; AMAP: group: Fig = 5.98, 
P = 0.040, time: Figoiago = 11.82, P < 0.001, interaction: Figoia4o = 2.24, P < 0.001; 
AT core: Zroup: Fig = 9.69, P = 0.014, time: Figoi4ao = 31.83, P < 0.001, interaction: 
Figo1aao = 7.69, P < 0.001) followed by Bonferroni’s post hoc test (red bars with 
asterisks indicate time points with significant differences). The difference in AUC 
during the stress period was analyzed by unpaired t test (AHR: tg = 8.21; AMAP: tg = 
4.91; AT core: tg = 3.91). *P < 0.05; **P < 0.01; ***P < 0.001. Error bars indicate SEM. 
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monosynaptic pathway for driving sympathetic 
stress responses. First, DP/DTT—DMH pro- 
jection neurons were selectively lesioned. An 
AAV for Cre recombinase-dependent expression 
of taCasp3, which induces cell-autonomous 
apoptosis (23), and a retrograde AAV (AAVrg) 
for Cre expression were bilaterally injected 
into the DP/DTT and DMH, respectively, to 
selectively lesion double-infected neurons (i.e., 
DP/DTT—DMH projection neurons) (Fig. 4A). 
Figure 4B shows an example of lesions of 
DP/DTT—DMH neurons: taCasp3 expression 
eliminated 88% of Cre-expressing (i.e., DMH- 
projecting) DP/DTT neurons in one represent- 


Fig. 5. Photoinhibition of the DP/ A 
DTT—-DMH pathway abolishes stress- 
induced avoidance behavior and skin 
vasoconstriction. (A) Wistar rats in 

which DP/DTT—DMH neurons were 
transduced with iChloC-mCherry or 

palGFP (Fig. 4E) underwent a social 
interaction test after SDS. (B) Repre- 


sentative examples of behavior mk ied 


tracking of palGFP- and iChloC- 
mCherry-transduced rats during the B 
social interaction test. (C) Behaviors 
during the social interaction test (n = 5 
per group). Naive rats did not undergo 
SDS before habituation and social inter- 
action test. Data were analyzed by one- 
way ANOVA (Entries: F524 = 12.87, P < 
0.001; Time spent: F524 = 15.49, 

P < 0.001; Moving distance: F524 = 9.15, 
P < 0.001) followed by Bonferroni's 
post hoc test. ns, not significant; 

**P < 0.01; ***P < 0.001. Error 

bars indicate SEM. (D and E) Thermo- 
graphic measurements at proximal and 
distal parts [arrowheads in (D)] on the 
tail skin (n = 5 per group) were analyzed 
by repeated measures two-way ANOVA 
(Proximal: treatment: Fy.4 = 17.44, P = 
0.014; group: Fi.4 = 2.11, P = 0.220; 
interaction: F,,4 = 9.01, P = 0.040; Distal: 
treatment: Fy,4 = 12.13, P = 0.025; group: 
Fi4 = 10.74, P = 0.031; interaction: 

Fig = 11.36, P = 0.028) followed by 
Bonferroni's post hoc test. ns, not signif- 
icant; *P < 0.05 (versus habituation). 
Error bars indicate SEM. (F) Model of the 
central psychosomatic pathways that 
drive stress responses. The DP/DTT 
integrates signals from multiple forebrain 
regions processing stress and emotion 
and then provides a glutamatergic (Glu) 
master signal to the DMH to excite 
neuronal groups controlling different 
effectors. The excited DMH neurons drive 
sympathetic outflows to BAT, the heart, 
and cutaneous and visceral blood vessels 


~] 
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ative rat as compared with palGFP-transduced 
control rats. This lesioned rat failed to exhibit 
BAT thermogenic and hyperthermic responses 
to SDS (Fig. 4C). Group data (Fig. 4D and fig. 
S5A) showed that stress-induced BAT thermo- 
genesis was inhibited in proportion to the 
extent of lesions of DP/DTT—DMH neurons, 
and the stress response was totally eliminated 
when all the neurons were lesioned. However, 
the lesions of DP/DTT—-DMH neurons did not 
affect diurnal control of Tar or Teore (fig. S5B). 

To exclude the possible influence of long- 
term alterations in the circuits after neuronal le- 
sions, we optogenetically suppressed DP/DTT— 
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DMH transmission in free-moving rats. The 
Cre-dependent expression system with an- 
terograde and retrograde AAVs was used to 
selectively transduce DP/DTT—DMH projection 
neurons with iChloC, a chloride-conducting 
channelrhodopsin shown to photoinhibit neu- 
ronal activity (24) (Fig. 4E). Expressed iChloC- 
mCherry proteins (or palGFP for control) were 
bilaterally detected in cell bodies in the 
DP/DITT and their nerve endings in the DMH 
(Fig. 4F). In vivo bilateral illumination of iChloC- 
mCherry-containing nerve endings in the DMH 
with pulsed blue light exerted potent, long- 
lasting inhibition on SDS-induced increases 
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through the rMR and partly through the rostral ventrolateral medulla (RVLM). Another subset of DMH neurons drives avoidance behavior through as-yet-unknown circuit 
mechanisms. AIP, posterior part of the agranular insular cortex; Pir2, layer Il of the piriform cortex. 
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in HR, MAP, and Tore (HR: 63% inhibition, 
MAP: 52% inhibition, Tyore: 66% inhibition; 
Fig. 4G) as compared with illumination of 
palGFP-expressing nerves. Commensurate 
with these inhibitions, iChloC-mCherry was 
expressed in 70 + 2% of Cre-expressing (DMH- 
projecting) DP/DTT neurons (counted in three 
rats in which Cre immunostaining was successful; 
Fig. 4E). Photoinhibition of DP/DTT—DMH 
transmission did not reduce HR, MAP, or Toore 
below baseline levels, which indicates that this 
pathway is not involved in basal maintenance 
of these parameters. 

Photoinhibition of DP/DTT—DMH trans- 
mission also suppressed BAT thermogenesis 
evoked by cage-exchange stress, an anticipa- 
tory anxiety model, in males and females (fig. 
S6), indicating that DP/DTT—DMH transmis- 
sion mediates sympathetic responses to broader 
types of psychological stressors in both sexes. 
In contrast, photoinhibition of IL—DMH trans- 
mission did not impede SDS-induced stress 
responses (fig. S7). 


The DP/DTT—-DMH pathway mediates stress 
behaviors and skin vasoconstriction 


Because the DP/DTT—DMH pathway is es- 
sential for driving the repertoire of sympa- 
thetic stress responses, we sought to examine 
its involvement in stress behavior. We thus 
investigated the effect of optogenetic inhibition 
of DP/DTT—DMH transmission on avoidance 
behavior from psychosocial stressors (25). A 
male Wistar rat in which DP/DTT—DMH neu- 
rons were selectively transduced with iChloC- 
mCherry or palGFP (Fig. 4E) was subjected to 
SDS and moved to an open field for habituation 
(Fig. 5A). Subsequently, the dominant male 
Long-Evans rat used in the SDS episode was 
caged and placed in the open field (Fig. 5A). 
All of the stressed Wistar rats that did not 
undergo illumination exhibited avoidance 
behavior by staying away from the social in- 
teraction zone surrounding the cage of the 
Long-Evans rat (Fig. 5, B and C). By contrast, 
naive Wistar rats, which had not experienced 
SDS, exhibited active social interactions (Fig. 
5C). The stress-induced avoidance behavior was 
suppressed by photoinhibition of DP/DTT— 
DMH transmission in iChloC-mCherry rats, 
and their social interactions were restored 
to the level exhibited by naive rats, whereas 
illumination in palGFP rats had no effect (Fig. 
5, B and C). Illumination administered when 
iChloC-mCherry rats were lying down in a 
relaxed state in the open field without the 
presence of a Long-Evans rat did not elicit 
locomotion (trials in four rats, five illumina- 
tion episodes per rat). Therefore, it is unlikely 
that the behavioral effect of photoinhibition 
resulted from stimulation of any motivation 
or vigilance circuit system. In addition, photo- 
inhibition did not alter social interactions ex- 
hibited by naive rats (naive ON; Fig. 5C). 
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Thermography revealed reduction of tail skin 
temperature in palGFP rats during the social 
interaction test (with illumination) compared 
with temperature during habituation (Fig. 5, D 
and E). This outcome represents stress-induced 
cutaneous vasoconstriction, a sympathetic re- 
sponse that contributes to stress-induced hy- 
perthermia by reducing heat loss (26). Tail skin 
vasoconstriction was absent in iChloC-mCherry 
rats with photoinhibition of DP/DTT—-DMH 
transmission (Fig. 5, D and E). 


The DP/DTT receives stress-driven inputs from 
thalamic and cortical regions 


To explore the forebrain regions that provide 
stress inputs to the DP/DTT, we combined 
retrograde tracing from the DP/DTT with 
detection of stress-induced Fos expression. Rats 
that received a CTb injection into the DP/DTT 
(fig. S8, A and B) underwent SDS or sham 
handling. Substantial numbers of neurons were 
labeled with CTb in several regions of the thal- 
amus, insular and piriform cortices, and amyg- 
dala. SDS significantly increased Fos expression 
in CTb-labeled neurons in the mediodorsal (MD) 
and paraventricular (PVT) thalamic nuclei, the 
posterior part of the agranular insular cortex, 
and layer II of the piriform cortex (fig. S8, C to 
F). This observation suggests that the DP/DTT 
receives and integrates stress-driven inputs 
from these thalamic and cortical regions and 
provides the integrated signals to the DMH 
to drive sympathetic and behavioral stress re- 
sponses (Fig. 5F). 


Discussion 


By using the rodent model of psychosocial 
stress, we discovered a prefrontal cortex- 
hypothalamus excitatory pathway that drives 
sympathetic and behavioral stress responses, 
in which the DP/DTT was crucial for stress sig- 
naling to the DMH. Our anatomical, physiolog- 
ical, and optogenetic experiments revealed that 
VGLUT1-positive pyramidal neurons in the 
DP/DTT transmit psychological stress-driven 
glutamatergic signals to the DMH to elicit a va- 
riety of stress responses. Most notably, selective 
ablation or inhibition of the DP/DTT—DMH 
monosynaptic pathway abolished BAT ther- 
mogenic, skin vasoconstrictor, cardiovascular, 
and behavioral responses to SDS without af- 
fecting basal thermoregulatory or cardiovascular 
homeostasis. Our present findings demonstrate 
that the DP/DTT—DMH excitatory transmis- 
sion of psychological stress signals is a master 
driver of the wide range of sympathetic and 
behavioral stress responses (Fig. 5F). 

The DP/DTT is an unexplored brain area 
located at the ventral limit of the mPFC. In 
contrast, the PrL and IL have been a central 
focus of stress research and have been shown 
to provide signals to inhibit stress responses. 
In this study, we discovered a group of neurons 
that drive stress responses in the DP/DTT. 
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Photostimulation of DP/DTT—DMH trans- 
mission, but not that of IL—DMH, elicited 
sympathetic responses that mimic stress re- 
sponses. Photoinhibition of DP/DTT—DMH 
transmission, but not that of IL>DMH, sup- 
pressed sympathetic stress responses. Also, in- 
activation of DP/DTT neurons suppressed stress 
responses, in contrast to subtle effects of in- 
activation of IL neurons. Thus, we propose that 
there are two functional units in the mPFC: the 
ventral (DP/DTT) unit that drives stress re- 
sponses and the dorsal (PrL/IL) unit that in- 
hibits these responses. The inhibitory unit may 
constitute the negative feedback mechanism in 
which the stress hormones, glucocorticoids, 
act in the PrL and IL to mitigate or terminate 
stress responses (17). This feedback inhibition 
might involve stress-activated PrL/IL—DMH 
neurons (Fig. 1) and/or local inhibition of 
DP/DTT neurons from the PrL/IL. 

DP/DTT—DMH stress signaling activates 
DMH-rMR sympathoexcitatory neurons to 
drive stress responses (Fig. 5F). The DMH—rMR 
pathway also serves as the trunk pathway that 
controls body temperature and develops in- 
flammatory fever, whose activity level is con- 
tinuously controlled by descending inputs from 
the thermoregulatory center, the preoptic area 
(27). Because the DP/DTT—DMH pathway 
does not contribute to basal thermoregulation 
or cardiovascular control, the stress-driven ad 
hoc inputs from the DP/DTT are likely inte- 
grated in the DMH with the homeostatic tonic 
inputs from the preoptic area by impinging on 
the DMH—rMkR efferent neurons. In addition, 
stress-induced visceral and cutaneous vasocon- 
striction may also be mediated in part by DMH 
neurons innervating the rostral ventrolateral 
medulla (28, 29) (Fig. 5F). 

The DP/DTT—DMH pathway appears to 
constitute a key psychosomatic connection 
through which stress and emotions affect the 
autonomic and behavioral motor systems. Al- 
though the corticolimbic circuits that process 
stress and emotions are undetermined, the PVT 
and MD thalamic nuclei, which provide stress 
inputs to the DP/DTT, constitute a fear stress 
circuit involving the amygdala (30, 31). Thus, 
the stress and emotion signals processed by 
forebrain circuits are likely integrated at the 
DP/DTT and then transmitted to the DMH. 
In panic disorder, glutamatergic inputs to the 
DMH to develop the panic-prone state (32) may 
be provided from the DP/DTT. The DP/DTT—> 
DMH pathway is a potential target for treating 
psychosomatic disorders that involve aberrant 
physiological responses, particularly because 
this pathway does not contribute to basal 
autonomic homeostasis. 
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A neurodevelopmental origin of behavioral 
individuality in the Drosophila visual system 
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Mercedes Bengochea’, Liz Hellbruegge~*, Guangda Liu*®, Radoslaw K. Ejsmont?*, Andrew D. Straw®, 
Mathias Wernet?, Peter Robin Hiesinger??, Bassem A. Hassan?2>+ 


The genome versus experience dichotomy has dominated understanding of behavioral individuality. 

By contrast, the role of nonheritable noise during brain development in behavioral variation is 
understudied. Using Drosophila melanogaster, we demonstrate a link between stochastic variation in 
brain wiring and behavioral individuality. A visual system circuit called the dorsal cluster neurons (DCN) 
shows nonheritable, interindividual variation in right/left wiring asymmetry and controls object 
orientation in freely walking flies. We show that DCN wiring asymmetry instructs an individual's 

object responses: The greater the asymmetry, the better the individual orients toward a visual object. 
Silencing DCNs abolishes correlations between anatomy and behavior, whereas inducing DCN asymmetry 


suffices to improve object responses. 


ndividual variability in morphology is abun- 

dant, including among human identical 

twins and species that reproduce by par- 

thenogenesis (J, 2). In this regard, the 

brain is no exception. Examples of indi- 
vidual brain variation include differences of 
size, weight (3), and neuroanatomical parcel- 
lations of human brains (4, 5). In invertebrates, 
where individual neurons can be identified 
across animals, single neurons show variabil- 
ity in morphology, wiring (6), synaptic con- 
nectivity, and molecular composition across 
individuals (7-9). 

Similarly, innate behaviors, such as selective 
attention to stimuli, show individual variation 
even among genetically identical individuals 
(10-13). The stability of individual differences 
over time defines behavioral idiosyncrasies as 
animal individuality (74). It has been proposed 
that variability in innate behavior is due to 
neuromodulation of anatomically hardwired 
circuits (15-17). By contrast, there is evidence 
for developmental plasticity resulting in a 
range of possible circuit diagrams among in- 
dividuals (18, 19), but whether nonheritable 
individual anatomical differences in brain wiring 
can predict distinct behavioral outcomes is 
unexplored (20-23). 

To test whether stochastic wiring of neural 
circuits affects behavioral variation, we used 
Drosophila contralateral visual interneurons 
called the dorsal cluster neurons (DCNs) (24) 
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(also known as LC14) (25). DCNs exhibit up to 
30% wiring variability of their axonal projec- 
tions between individuals and between the 
left and right hemispheres of the same brain 
(26). DCN axons innervate two alternative 
target areas in the fly visual system called the 
medulla (M-DCNs) and the lobula (L-DCNs) 
(24). The decision whether any given DCN be- 
comes a M-DCN or L-DCN is determined by 
an intrinsically stochastic lateral inhibition 
mechanism mediated by the Notch signaling 
pathway (78). To test the link between wiring 
variation and behavioral variation, we used a 
visual behavioral assay called Buridan’s para- 
digm (27). In this assay, a fly is placed between 
two identical high-contrast stripes at 180° from 
each other in a uniformly illuminated arena 
(28). The stripes are unreachable, inducing 
the fly to walk back and forth between them 
during the assay. 

Here we report that flies show behavioral 
individuality that is nonheritable and is not 
reduced through inbreeding. We find that 
the degree in left-right DCN wiring asym- 
metry in the medulla is a predictor of behav- 
ioral performance of individual flies. The 
more asymmetric the DCN medulla inner- 
vation is, the narrower the path a fly walks 
between the two stripes. DCN activity is nec- 
essary for this correlation, and reengineering 
DCN asymmetry suffices to change an indi- 
vidual’s behavior. 


Results 


While analyzing object orientation responses 
in wild-type Canton S (CS) flies (Fig. 1A and 
movies S1 to S3), we noted sex-independent 
interindividual variability in their trajectories 
(Fig. 1, B and C). We focused on a parameter 
called absolute stripe deviation (henceforth 
aSD), measuring the deviation from the nar- 
rowest possible path between the stripes. Al- 
though males tend to walk narrower paths, 
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the degree of interindividual variation in aSD 
is the same between males and females (Fig. 
1D). We therefore continued our studies with 
combined populations (Fig. 1E). 


Object orientation variability is independent of 
genetic diversity 


To test whether behavioral variability cor- 
relates with genetic diversity, we screened a 
subset (N = 10) of the Drosophila genomic 
reference panel (DGRP) (29) for genetically 
homogeneous strains with extreme object 
orientation responses. This identified two 
strains with opposing behavioral phenotypes: 
DGRP-639 showed low aSD (Fig. 1, F and G), 
whereas DGRP-859 showed high aSD (Fig. 1, 
H and I). Similar behavioral differences were 
found in seven other representative behav- 
ioral parameters (fig. S1). However, despite 
the extreme reduction of genetic diversity, 
the degree of individual variation in aSD was 
not reduced (Fig. 1, G and I). On the contrary, 
DGRP-639 showed increased behavioral var- 
iability (Fig. 1G and fig. S1), hinting at the 
nonheritability of this variability. 


Individual object orientation responses 
are nonheritable 


If the genotype of an individual determines its 
behavior, repeated breeding of parental ani- 
mals with a specific behavioral trait should 
select for a specific behavior, creating a be- 
haviorally homogeneous population. We mated 
three pairs with the lowest and highest aSD 
scores, respectively (Fig. 2, A and B), and object 
orientation responses were measured in their 
offspring (Fig. 2, C and D). We found no dif- 
ferences between the two sets of offspring in 
aSD scores as well as six other parameters 
tested (fig. S2A). The same was true for the 
offspring of a single pair with low and high 
aSD (fig. S2, B and C). We repeated the same 
breeding schemes with the near-isogenic 
DGRP-639 and DGRP-859 for seven gener- 
ations. We found that for most parameters, 
a breeding pair reproduces the full range of 
variability in the population at every gener- 
ation (figs. S3 and S4). 


Individual object orientation responses are 
stable over time 


An individual’s idiosyncratic behavioral pro- 
file may not be heritable either because it is 
driven by internal-state modulations, or be- 
cause it is driven by nonheritable develop- 
mental mechanisms. To distinguish these 
possibilities, we first tested the same indi- 
vidual CS flies once every other day for 3 days 
and found that an individual’s behavior 
was virtually identical over the three trials 
(Fig. 3, A and B). Statistical analysis of aSD 
showed that the individual responses of CS 
flies on different days were correlated (7 = 
0.74 to 0.77, Fig. 3E). The same was true for 
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path details like left- or right shifted angles | 4-week period. We found that the object re- 
(Fig. 3A and fig. S5C), distance, full walks, me- | sponses of individuals were stable over this 
ander, absolute horizon deviation, absolute | extended period (Fig. 3, C and D, and fig. 


angle deviation, angle deviation, 


and center | S6, A and B). This stability argued against 


deviation. We extended this analysis over a | state modulations and in favor of individ- 
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Fig. 1. Individual variation of Drosophila stripe responses is independent of 
gender or genetic diversity. (A) Drosophila object orientation responses are 
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ual properties. Indeed, starvation followed 
by refeeding over a period of 3 days failed 
to reduce stability of individual performances 
despite obvious changes in mean population 
behavior (fig. S7). Finally, we asked whether 
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shows the cumulative aSD for CS male and female flies (N = 98). (F) DGRP- 
639 flies (N = 61) showed on the population level an object orientation 
na. (B) Male CS wildtype flies (N = 50) response toward the stripes that exceeds even the response of CS (Tukey 


showed on the population level (shown in a heatmap) an object orientation test, p = 0.01). Three examples of individual responses show the individual 


response toward the stripes that are located at the top and bottom. The 


differences. (G) The histogram for aSD shows that DGRP-639 flies (N = 61) 


categorization of individual responses into strong and weak object orientation exceed the variability of CS flies (F-test, p < 0.001). The distribution is shifted 


responses shows the entire repertoire 
tracks). (€) Female CS wild-type flies 

orientation responses as their male co 
show that CS male (N = 50) and fem 


range of individual responses. The histogram shows in bins of 5 % of the 
radius the distribution of aSD for each population. The distributions for males 


and females are statistically identical 


unterparts. (D) The histograms for aSD 


of responses (shown as individual fly toward lower aSDs. (H) DGRP-859 flies (N = 59) showed on the population 
N = 48) showed the same object level (heatmap) a weak object orientation response toward the stripes. The 
main population response is edge behavior. Two examples of individual 

ale (N = 48) flies displayed the same responses show prevalent individual differences that include also individuals with 
strong object orientation responses. (I) The histograms for aSD indicate that 
DGRP-859 flies (N = 59) show variability comparable to that of CS flies. The 


(Tukey test, p = 0.1). (E) The histogram distribution is shifted toward higher aSDs. 
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Selection of 3 pairs of animals with lowest and highest stripe deviation from CS 
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Fig. 2. Individual variation is independent of genetic selection. 

(A and B) The three lowest- (A) and highest-scoring couples (B) for aSD were 
chosen from a population of 47 CS males and 37 virgin CS females. The 
heatmaps in the top row show, from left to right: (i) the three virgin females 
and (ii) males with the lowest aSD, (iii) the three virgin females, and (iv) 
males with the highest aSD. (C and D) The offspring of these two populations 


reduced genetic diversity affects behavioral 
stability. We performed repeated testing of 
DGRP-639 and DGRP-859 individual flies and 
found that both inbred strains showed tem- 
porally stable individual responses (Fig. 3, F 
and G, and fig. S5, A to C). 

Together, the data show that individual 
variability in object orientation is a non- 
heritable, temporally stable trait that is 
independent of sex, genetic background, 
and genetic diversity. Where in the brain 
might such individuality in visual behav- 
ior originate? 


A variable set of commissural 
visual interneurons 


In 1982 Bilthoff (30) suggested, based on 
work by Zimmermann (37) and Gotz (32), 
that object position processing in Drosophila 
(33) requires qualitative asymmetry of the 
visual percept of an object. However, direct 
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both populations. 


evidence for this notion is lacking, especially 
that the sizes of the left and right eyes of the 
same fly are highly correlated (34). In 1986, 
while analyzing object responses in motion- 
blind flies, Heisenberg and colleagues sug- 
gested that binocular interactions, through 
higher-order commissural visual interneur- 
ons, are required for object orientation (35). 
Putting the two predictions together we hy- 
pothesized that variation in object orienta- 
tion responses is regulated by the variation in 
the asymmetry of a higher-order contralateral 
visual circuit innervating the frontal visual 
field. The DCNs match this predicted circuit 
(Fig. 4A). 

To obtain a comprehensive description of 
DCN wiring, we extended the previous analyses 
of DCNs that were based on 16 female flies (18), 
to 103 males and females. We found that the 
number of DCNs varied from 22 to 68 cells, 
with a range of 11 to 55 L-DCNs and 6 to 23 M- 
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are shown in the middle row separated by a black stripe. The behavioral 
heatmaps and variability histograms of the two populations of offspring 
are statistically indistinguishable (N = 180 for both; two-way analysis of 
variance (ANOVA) and Tukey HSD as post hoc test; p = 0.22). The bottom 
row shows examples of individuals representing the range of variability in 


DCNs (Fig. 4B and fig. S8, A and B). In addi- 
tion, we observed a distribution of variation 
in medulla-targeting asymmetry by M-DCNs 
(Fig. 4C, histogram distributions; fig. S8B). 
The distribution of all DCN asymmetries 
showed a peak of low asymmetries, although 
extreme asymmetries were present but rare. 
Finally, three-dimensional reconstruction 
showed that M-DCN axons terminate in the 
posterior medulla (movies S4 to S6), where 
visual columns from the frontal visual field 
are located, and the DCN wiring pattern in 
the medulla does not change in the adult (fig. 
S9 and movies S7 and S8). 


Individual wiring variability drives 

behavioral individuality 

DCNs represent an ideal candidate for an in- 
trinsically asymmetric population of contra- 
lateral higher-order interneurons to mediate 
object responses (35). To test this hypothesis, 


3 of 8 


RESEARCH | RESEARCH ARTICLE 


A CS repeated individual testing - strong stripe response 


CS long term stability - strong stripe response 


aint 
cat 
lf 


Wy 
ML 
in 
fe 
qi 
f 


Individual 1 Week 1&4 


[ day? |iasa 


CS repeated individual testing - weak stripe response 


[featmap | Day? [_Day2 | Dana) 
m y om™ es 


CS long term stability - weak stripe response 


[wets] bay? IL bay [bana 


Individual 2 Week 1&4 


Fig. 3. Individual variation of Drosophila object orientation responses is 
stable over time. Adult CS flies (N = 74) were repeatedly tested over the 
duration of 3 days and several weeks. The flies showed marked stability in their 
responses irrespective of whether they responded strongly or weakly to the 
visual cue, or even showed no response toward the visual cue at all. (A) Two 
different examples of CS flies with low aSD. The heatmap and the individual 
tracks for the 3 days show persistent behavior. Even the angle toward the stripes 
is conserved throughout the 3 days. The upper row is right shifted, and the 
bottom row is left shifted. (B) Two different examples of CS flies with high aSD. 
The positional preferences are conserved between the different days. The upper 
row shows an animal with local preferences and the lower one performs a 
random walk. (C and D) Two different examples of CS flies with strong (individual 1) 
and high aSD (individual 2) that show persisting behavior throughout three 
consecutive days and several weeks. (E) Statistical analysis for aSD shows that 
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the CS responses of the different days and weeks are correlated. The Pearson 
correlation coefficient for day 1 versus day 2 is 0.76 with p < 0.001. For day 

1 versus day 3, the correlation coefficient is 0.76 with p < 0.001. For day 2 versus 
day 3, the correlation coefficient is 0.76 with p < 0.001. For week 1 versus week 4, 
the correlation coefficient is 0.66 with p < 0.001. (F) Similar to the CS data, 
the responses for DGRP-639 were moderately to strongly correlated (N = 52, animals 
with low path length were removed; see fig. S3A for examples). The Pearson 
correlation coefficient for day 1 versus day 2 is 0.39 with p = 0.0047. For day 
1 versus day 3, the correlation coefficient is 0.63 with p < 0.001. For day 

2 versus day 3, the correlation coefficient is 0.59 with p < 0.001. (G) The 
correlation between days for DGRP-859 (N = 76) even exceeds the data for 

CS. The Pearson correlation coefficient for day 1 versus day 2 is 0.76 with 

p < 0.001. For day 1 versus day 3, the correlation coefficient is 0.64 with p < 0.001. 
For day 2 versus day 3, the correlation coefficient is 0.82 with p < 0.001. 
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we first asked whether the DCNs were re- 
quired for object orientation. Inactivating ei- 
ther all DCNs or only M-DCNs resulted in a 
strong increase in aSD (Fig. 4, D and E, and 
fig. S8, C and D). Next, we queried the rela- 


tionship between individual variability in 
object orientation behavior and individual 
variability in DCN wiring (NV = 103) (fig. S10). 
Unbiased correlation analysis between 36 be- 
havioral parameters and 37 prominent DCN 


anatomical features (Fig. 5A and fig. S11, A to 
C) showed that left-right asymmetry in M-DCN 
innervation correlated with an individual’s aSD 
(Fig. 5C, 7 = -0.67) and other interdependent 
parameters (fig. S11A). Individuals with high 
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Fig. 4. Normal stripe responses require DCN function. (A) The DCNs are 
commissural neurons in the visual system of the fly. The DCNs have dorsally 
ocated cell bodies that send out an ipsilateral dendrite and a contralateral axon. 
This axon innervates either the visual neuropil lobula or medulla. Two independent 
driver lines are shown for the DCN neurons: ato-lexA (red, lexAOP-myr-td Tomato) HSD as post hoc test, p < 0.001). Higher aSD means that the animals fixate the 
stripes less. (E) Similar results are obtained by M-DCN neuron silencing with 
VT037804-GAL4. The heatmap of the control population of VT037804-GAL4/+ 
variability in their axonal branching pattern, as shown for three individual brains. _ flies shows normal object orientation in the two-stripe arena. This is lost upon 
anges silencing of DCN neurons in VT037804>Kir2.1 animals. Statistical analysis 

(N = 72) of aSD shows that VT037804>Kir2.1 animals show higher aSD than the 
controls (p < 0.001). Scale bars, 20 ym. 
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marks all DCNs whereas VT037804-GAL4 (green, UAS-myr-GFP) 
-DCN neurons that innervate the medulla. (B and C) The DCNs display high 


Statistical analysis shows that the number of medulla axon branches 
rom 6 to 23 axons with a mean of 13.99 (B). The medulla asymmetry ranges from 
0 to 10 axons with a mean of 2.98 (C). (D) DCN neuron silencing leads to an 
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ncrease in aSD. The heatmap of the control population of ato-GAL4/+ flies shows a 
normal response in the two-stripe arena. This is lost upon silencing of DCN neurons 
n ato>Kir2.1 animals. Statistical analysis (N = 57 to 63) of aSD shows that 

ato>Kir2.1 animals show higher aSD than the controls (two-way ANOVA and Tukey 
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M-DCN asymmetry have a low aSD, whereas 
individuals with symmetric M-DCN have a 
high aSD (Fig. 5, B and C). To test if DCN 
wiring asymmetry is a functional driver of 
individual object orientation behavior, we si- 


Fig. 5. Individual 
variation of anatomi- 
cal brain asymmetry 
regulates behavioral 
variation. (A) To 
correlate behavioral 
variation with brain 
anatomical variation, 
flies were tested 
behaviorally in the 
Buridan arena, their 
brains dissected and 
stained, and DCN 
anatomy imaged, 
keeping individual 
information intact. 
After computational 
analysis of fly tracks 
and automated 
neuronal reconstruc- 
tion of the brain 
anatomy, a cross- 
correlational analysis of 
both data sets was 
performed. (B) Animals 
with high left-right 
asymmetry of DCN 
axonal projections 
showed better 
performance than 
animals with lower 
asymmetry. The 
colored numbers show 
the reconstructed 
medulla axons in each 
brain hemisphere. 
(C) Statistical analysis 
(N = 103) shows that 
the medulla asymmetry 
correlates (r = 0.67, 

p < 0.001) with aSD. 
The blue and orange 
crosses mark the posi- 
tion of the blue and 
orange individuals 
shown in (B). (D) DCN 
silencing with tetanus 
toxin results in the loss 
of a correlation 
between medulla right- 
left asymmetry and 
aSD. High-asymmetry 
and low-asymmetry 


lenced DCNs and repeated the analyses. This 
abolished the correlation between M-DCN 
asymmetry and aSD, but not stripe detection 
per se (Fig. 5, D and E, 7 = —0.002 and figs. 
$12 and S13). 


DCN asymmetry determines object orientation 
in individuals 

Our data show that nonheritable develop- 
mental variation in DCN wiring asymmetry 
is necessary for creating variability in object 
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animals show approximately the same behavioral performance. (E) Statistical analysis shows that DCN silencing with tetanus toxin (TNT, N = 92) results in the 
loss of anatomy behavior correlations (r = —0.002, p = 0.98; control: N = 89, r = -0.54, p < 0.001, shaded dark gray). The examples of (D) are marked with a blue and 
orange cross in the correlation plot. Scale bars, 20 um. 
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Fig. 6. Individual variation of 
anatomical brain asymmetry 
suffices for behavioral variation. 
(A) Schematic for the develop- 
mental inhibition of endocytosis 
using UAS-shibire'S during the 
critical period (24 to 48 hours after 
puparium formation). The manipu- 
ation to change asymmetry is 
performed at the time of the choice 
between M-DCN or L-DCN (18). 
(B) Blocking endocytosis during 
DCN development increased 

he proportion of individuals with 
symmetric wiring and correspond- 
ingly aSD, with no effect on 

the correlation between anatomy 
and behavior in single flies. 

(C) Correlation curve of the 
UAS-shibire's individuals described 
in (B) (N = 27, r = -0.69, p < 0.001; 
control: N = 25, r = -0.75, 

p < 0.001, shaded dark gray). 

(D) Directed induction of one-sided 
clones in M-DCNs expressing 
Kir2.1 results in stronger object 
orientation responses than in 

the genetic control [same 
genotype, no heat shock (HS) 
clonal induction, p = 0.007] and 
in the HS control (identica 
genotype but lacking UAS-Kir.2.1, 
p = 0.006). Data were analyzed by 
two-way ANOVA and Tukey test. 
Scale bars, 20 um. 
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orientation behavior across individuals. We 
therefore wondered if the changed object 
orientation responses in the DGRP strains 
reflect DCN asymmetry alterations. We found 
that the low aSD strain DGRP-639 displayed 
more DCN wiring asymmetry, and the high aSD 
strain DGRP-859 less DCN wiring asymmetry 
(fig. S14), consistent with our hypothesis. Next, 
we developmentally rewired the DCNs either 
by blocking endocytosis to inhibit develop- 
mental signaling among DCNs or by activating 
the Notch pathway, both in a DCN-specific 
fashion. This resulted in reduced DCN wiring 
asymmetry and a correspondingly higher aSD, 
while preserving the correlation between wiring 
and behavior (Fig. 6, A to C, and figs. S15 and 
S16). Finally, we genetically engineered flies 
to generate one-sided DCN clones expressing 
the neuronal silencer Kir2.1. Animals with 
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Larvae 96h 


13 M-DCN 


19 M-DCN 


asymmetrically silenced clones showed lower 
aSD scores than controls with unsilenced 
clones or no clones at all (Fig. 6D and fig. S17). 
Together, these data causally link DCN wiring 
asymmetry to object orientation responses. 

Finally, to test our hypothesis further, we 
asked if generating any asymmetry in visual 
processing is sufficient to override high stripe 
deviation. Among 79 CS flies tested, we se- 
lected the 20 with the highest aSD indices 
(>40), performed monocular deprivation, and 
tested them again. This resulted in a reduction 
of aSD in these flies, as well as in the entire 
population (fig. S18). 


Discussion 


The origins of behavioral individuality are a 
central question in neuroscience, psychology, 
and evolution. The discovery of stable individ- 


6 March 2020 


A Dorsal Cluster Neuron (DCN) correlation anatomy - behavior (developmental manipulations, shibirets) 
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ual traits in nonhuman vertebrates (16) and 
invertebrates facilitated research on behavy- 
ioral variation (JO, J4) and offered both genetic 
(11, 12, 36) and neuromodulatory (II, 15, 36) 
explanations for behavioral idiosyncrasies. Here 
we establish a link between variability in the 
development of the brain and the emergence 
of individuality of animal behavior. Our work 
shows that intrinsically stochastic mechanisms 
of brain wiring give rise to intraindividual var- 
jation of left-right asymmetry in the innerva- 
tion of the fly visual areas, which explains the 
individuality of behavioral differences in object 
responses. The amenability of the relatively 
complex Drosophila brain to multiscale anal- 
ysis, from the molecular to the behavioral, at 
single-animal resolution makes it a model for 
understanding the emergence of individual- 
ity at each of these scales. We speculate that 
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similar mechanisms and consequences will hold 
true in other species, including humans. 

Previous work in Drosophila visual behav- 
ioral neuroscience led to the proposal that 
asymmetry in visual information processing 
influences object responses. Where such func- 
tional asymmetry lay and how it might arise 
has, until now, remained unclear. Indepen- 
dently, the study of object responses in motion- 
blind mutants led Heisenberg and colleagues 
to propose a hypothetical contralateral circuit 
dedicated to object responses in the frontal 
visual field (28, 30-32, 35). Our discovery that 
DCN asymmetry drives object orientation re- 
sponses in individuals is an elegant solution 
combining both predictions: a contralateral 
asymmetric visual circuit that regulates object 
orientation in the frontal visual field. Future 
work will reveal the exact physiological conse- 
quences of morphological asymmetry, such as 
whether wiring asymmetry induces timing dif- 
ferences as in auditory navigation (37) or whether 
the absolute differences are simply summed up. 

Our work provides evidence for the genera- 
tion of multiple brain and behavior phenotypes 
from the same genotype via developmental 
stochasticity and noise. This can serve as a 
robustness factor for both the individual and 
the population by increasing the chances of 
survival of any given genome in case of strong 
selection pressure (22). 
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LIQUID CRYSTALS 


Topological structure and dynamics 
of three-dimensional active nematics 


Guillaume Duclos“, Raymond Adkins**, Debarghya Banerjee*“, Matthew S. E. Peterson’, 
Minu Varghese’, Itamar Kolvin’, Arvind Baskaran’, Robert A. Pelcovits®, Thomas R. Powers®°, 
Aparna Baskaran’, Federico Toschi”®, Michael F. Hagan’, Sebastian J. Streichan?, 

Vincenzo Vitelli?, Daniel A. Beller’°+, Zvonimir Dogic’?+ 


Topological structures are effective descriptors of the nonequilibrium dynamics of diverse many-body 
systems. For example, motile, point-like topological defects capture the salient features of 
two-dimensional active liquid crystals composed of energy-consuming anisotropic units. We dispersed 
force-generating microtubule bundles in a passive colloidal liquid crystal to form a three-dimensional 
active nematic. Light-sheet microscopy revealed the temporal evolution of the millimeter-scale 
structure of these active nematics with single-bundle resolution. The primary topological excitations 
are extended, charge-neutral disclination loops that undergo complex dynamics and recombination 
events. Our work suggests a framework for analyzing the nonequilibrium dynamics of bulk 
anisotropic systems as diverse as driven complex fluids, active metamaterials, biological tissues, 


and collections of robots or organisms. 


he sinuous change in the orientation of 
birds flocking is a common but startling 
sight. Even if one can track the orienta- 

tion of each bird, making sense of such 
large datasets is difficult. Similar chal- 
lenges arise in disparate contexts from mag- 
netohydrodynamics (7) to turbulent cultures 
of elongated cells (2), where oriented fields 
coupled to velocity undergo complex dynam- 
ics. To make progress with such extensive three- 
dimensional (3D) data, it is useful to identify 
effective degrees of freedom that allow a coarse- 
grained description of the collective nonequi- 
librium phenomena. Promising candidates 
are singular field configurations locally pro- 
tected by topological rules (3-9). Examples 
of such singularities in 2D are the topolog- 
ical defects that appear at the north and south 
poles when covering the Earth’s surface with 
parallel lines of longitude or latitude. These 
point defects are characterized by the winding 
number of the corresponding orientation field. 
The quintessential systems with orienta- 
tional order are nematic liquid crystals, which 
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are fluids composed of anisotropic molecules. 
In equilibrium, nematics tend to minimize 
energy by uniformly aligning their anisotropic 
constituents, which annihilates topological de- 
fects. By contrast, in active nematic materials, 
which are internally driven away from equi- 
librium, the continual injection of energy de- 
stabilizes defect-free alignment (J0, 11). The 
resulting chaotic dynamics are effectively rep- 
resented in 2D by point-like topological de- 
fects that behave as self-propelled particles 
(12-16). The defect-driven dynamics of 2D 
active nematics have been observed in many 
systems ranging from millimeter-sized shaken 
granular rods and micrometer-sized motile 
biological cells to nanoscale motor-driven bio- 
logical filaments (77-23). Several obstacles have 
hindered generalizing topological dynamics 
of active nematics to 3D. The higher dimen- 
sionality expands the space of possible defect 
configurations. Discriminating between differ- 
ent defect types requires measurement of the 
spatiotemporal evolution of the director field 
on macroscopic scales using materials that 
can be rendered active away from surfaces. 
The 3D active nematics that we assembled 
are based on microtubules and kinesin mo- 
lecular motors. In the presence of a depleting 
agent, these components assemble into iso- 
tropic active fluids that exhibit persistent 
spontaneous flows (77). Replacing a broadly 
acting depletant with a specific microtubule 
cross-linker, PRCI-NS, enabled assembly of a 
composite mixture of low-density extensile 
microtubule bundles (~0.1% volume frac- 
tion) and a passive colloidal nematic based 
on filamentous viruses (Fig. 1A), a strategy 
that is similar to work on the living liquid 


6 March 2020 


crystal (21). Adenosine 5’-triphosphate (ATP)- 
fueled stepping of kinesin motors generates 
microtubule bundle extension and active 
stresses that drive the chaotic dynamics of 
the entire system (movie S1). Birefringence of 
the composite material indicates local nematic 
order (Fig. 1B), in contrast to active fluids lack- 
ing the passive liquid crystal component. 

Elucidating the spatial structure of a 3D 
active nematic requires measurement of the 
nematic director field on scales from micro- 
meters to millimeters. Furthermore, uncov- 
ering its dynamics requires acquisition of 
the director field with high temporal resolu- 
tion. To overcome these constraints, we used 
a multiview light sheet microscope (Fig. 1C) 
(24). The spatiotemporal evolution of the ne- 
matic director field n(wv,y,z,t) was extracted 
from a stack of fluorescent images using the 
structure tensor method. Spatial gradients 
of the director field identified regions with 
large elastic distortions (Fig. 1D and movie 
$2). Three-dimensional reconstruction of such 
maps revealed that large elastic distortions 
mainly formed curvilinear structures, which 
could either be isolated loops or belong to a 
complex network of system-spanning lines 
(Fig. 1E and movie S83). These curvilinear dis- 
tortions are topological disclination lines 
characteristic of 3D nematics. Similar struc- 
tures were observed in numerical simulations 
of 3D active nematic dynamics using either a 
hybrid lattice Boltzmann method or a finite 
difference Stokes solver numerical approach 
(Fig. IF) (25, 26). 

Reducing the ATP concentration slowed 
down the chaotic flows, which revealed the 
temporal dynamics of the nematic director 
field. In turn, this identified the basic events 
governing the dynamics of disclination lines 
(movie S4). We focused on characterizing the 
closed loop disclinations because they are the 
objects seen to arise or annihilate in the bulk. 
Isolated loops nucleated and grew from un- 
distorted, uniformly aligned regions (Fig. 2A, 
figs. S1 and S2, and movie S5). Likewise, loops 
also contracted and self-annihilated, leaving 
behind a uniform region (Fig. 2B, figs. S1 and 
$2, and movie S6). Furthermore, expand- 
ing loops frequently encountered and subse- 
quently merged with the system-spanning 
network of distortion lines, whereas the dis- 
tortion lines in the network self-intersected 
and reconnected to emit a new isolated loop 
(Fig. 2, C and D; figs. S1 and S2; and movies S7 
and S8). 

Topological constraints require that topo- 
logical defects can only be created in sets 
that are, collectively, topologically neutral. 
Point-like defects in 2D active nematics thus 
always nucleate as pairs of opposite winding 
number (73). In 3D active nematics, an iso- 
lated disclination loop as a whole has two 
topological possibilities: It can either carry a 
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Fig. 1. Assembling 3D active nematics and imaging their director field. (A) Schematic of the 3D active 
nematic system: active stress—generating extensile microtubule bundles are dispersed in a passive colloidal 
liquid crystal. (B) Active 3D nematic imaged with widefield fluorescent microscopy (left) and polarized 
microscopy (right). Birefringence indicates local nematic order. (C) Multiview light sheet microscopy allows 
for 3D imaging of millimeter-sized samples with single-bundle resolution. (D) Left: A 2D slice of fluorescent 
microtubule bundles with highlighted elastic distortions. Right: Corresponding elastic distortion energy map, with 
an overlaid nematic director field (red). (E) Three-dimensional elastic distortion map revealing the presence 
of curvilinear rather than point-like singularities. An entangled network of lines coexists with isolated loops. 
(F) Hybrid lattice Boltzmann simulations yield a similar structure of 3D active nematics. All experimental 
samples consist of passive fd viruses at 25 mg/mL and microtubules at 1.33 mg/mL. 


Fig. 2. Dynamics of experimentally A v ~ 
observed disclination loops. (A) Loop Nucleation 
nucleation from a defect-free region. 
(B) Loop self-annihilation leaves @) 
behind a defect-free nematic. Sf 
(C) Disclination line self-intersects, q 
reconnects, and emits a loop. Ww i 
(D) Disclination loop intersects, B ne WH . _ S, =~ WW ee 
reconnects, and merges with a Annihilation yo are Owe ar SS 
disclination line. Each bounding ( ir \ ‘ ‘oe 4 IP X \x 
box is 30 x 30 x 38 um. The time Koy, Xd q 5@ \ 
interval between two pictures is 12 s. a. oats < ‘Trost Sd 1 KS 
~ ma % A] < ic i 
+ \ i i 7 } 
Cc 
Split re ’ 
c\ Ss 
D 7 ¥ 
Merge | t m~ 


Duclos et al., Science 367, 1120-1124 (2020) 6 March 2020 20f5 


RESEARCH | REPORT 


monopole charge or be topologically neutral, 
depending on its director winding structure. 
Because charged topological loops can only 
appear in pairs, nucleation of isolated loops 
as observed in our system implies their to- 
pological neutrality. 


Fig. 3. Structure of disclinations lines, wedge- 
twist, and pure-twist loops. (A) Disclination line 
where a local +1/2 wedge winding continuously 
transforms into a -1/2 wedge through an interme- 
diate twist winding. The director field winds by x 
about the rotation vector @ (black arrows), which 
makes angle B with the tangent t (orange arrow) and 
is orthogonal to the director field everywhere in 
each slice. For +1/2 wedge windings, B = 0 and x. 

B = x/2 indicates local twist winding. Reference 
director ng (brown) is held fixed. Color map indicates 
angle B. (B) Wedge-twist loop where local winding as 
reflected by angle B varies along the loop. Q is 
spatially uniform and forms an angle y = 2/2 with 
the loop's normal, N. The winding in the four 
illustrated planes corresponds to the profiles of the 
same colors shown in (A), with dashed edges of 
squares aligned to match the local director field. 
Double-headed brown arrows indicate Nout, the 
director just outside the loop. (C) Pure-twist loop, 
with @ both uniformly parallel to loop normal N 

(y = 0) and perpendicular to the tangent vector. 
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To establish that the closed-loop distor- 
tions are nematic disclination loops with no 
net charge, we characterized their topolog- 
ical structure. In 2D nematics, point-like dis- 
clination defects are characterized by the 
winding number or topological charge(s). The 
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lowest-energy disclinations have s = +1/2, 
which corresponds to a x rotation of the di- 
rector field in the same sense or the oppo- 
site sense, respectively, as the traversal of 
any closed path encircling only the defect 
of interest. In 3D nematics, point-like defects 
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Fig. 4. Structure of disclination loops in experiments and theory. (A) Two 
orthogonal views of an experimental wedge-twist loop overlaid onto a fluo- 
rescent image of the microtubules. The nematic director is shown in red. 

(B and E) Structure of wedge-twist disclination loops in experiments and sim- 
ulation. (© and F) Structure of pure-twist disclination loops from experiment 
and simulation. Panels show the director field's winding in the corresponding 
cross-sections on the experimental loops. (D) Distribution of loop types 
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extracted from experiment (N = 268) and hybrid lattice Boltzmann simulations 
(N = 94). |cos(y)| = 0 for wedge-twist loops and 1 for pure-twist loops. 
Distributions of standard deviations of |cos(y)| are shown in fig. S3. The count 
of simulated loops includes analysis of some loops at multiple time points 
because we did not track loop identity in the complex flow dynamics. Coloring 
of loops indicates the angle B. Scales and bounding boxes for the loops are 
shown in fig. S4. 
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from 2D systems are generalized to discli- 
nation lines, where the director similarly 
has a x winding, affording a broader variety 
of director configurations. We define t to be 
the disclination line’s local tangent unit 
vector. The director field winds by x about 
a direction specified by ®, the rotation vec- 
tor, which can make an arbitrary angle B 
with t (27). If @ points antiparallel or par- 
allel to t, then the local director field rotates 
in the plane orthogonal to t, assuming the 
disclination profiles familiar from 2D nem- 
atics. These configurations in which f£ is 
equal to 0 or z are said to have a local wedge 
winding (Fig. 3A). If 2 is perpendicular to t, 
then f = n/2 and the director forms a spa- 
tially varying angle away from the orthog- 
onal plane, locally creating what is called a 
twist winding. Because Q may point in any 
direction relative to t, both @ and B can 
vary continuously along a disclination line 
(movie S9). 

For disclination lines forming loops, Q 
can vary continuously providing it returns 
to its original orientation upon closure, lead- 
ing to a broad range of possible winding 
variations. A family of loops of particular 
relevance to 3D active nematics is character- 
ized by a spatially uniform Q, interpolating 
between two emblematic geometries: wedge- 
twist and pure-twist loops. In the wedge- 
twist loop, 2 makes an angle y = 1/2 with the 
loop normal N (Fig. 3B). As the disclination’s 
tangent t rotates by 2x upon traveling around 
the loop, the angle f varies from 0 (+1/2 wedge) 
to n/2 (twist), to x (-1/2 wedge), then back 
to n/2, and finally returning to 0 (movie S9) 
(27, 28). The pure-twist loop has 2 uniformly 
parallel to N, so y = 0 and Q is perpendic- 
ular to t (B = 1/2, twist profile) at all points 
on the loop (Fig. 3C) (27, 29). In this family 
of loops, the director just outside the loop, 
Nout, is also uniform. The lack of winding of 
both @ and Nout implies that both wedge- 
twist and pure-twist loops are topologically 
neutral (30, 31). 

Experimental measurements of the director 
field allowed us to fully characterize the topo- 
logical structure of the disclination loops 
(Fig. 4). Analysis of the director field indicated 
that the distortion lines and loops have the 1 
winding indicative of disclinations (Fig. 1, E 
and F), with continuous variation of B, which 
indicates local winding. Furthermore, most of 
the analyzed loops were well approximated by 
the family of curves where 2 and Noy varied 
little along the loop circumferences. Categoriz- 
ing loops according to their y values revealed 
that the entire continuous family from wedge- 
twist (Fig. 4, A and B) to pure-twist (Fig. 4C) 
was represented, with the latter being more 
prevalent (Fig. 4D). Structural analysis re- 
vealed topological neutrality, as all 268 ex- 
perimental loops and all 94 loops extracted 
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Fig. 5. Nucleation mechanism of wedge-twist and pure-twist loops. (A) Nucleation and growth of a 
wedge-twist disclination loop through a self-amplifying bend distortion. Purple rods represent the 2D director 
field through the local +1/2 wedge profiles. (B) Schematic of a wedge-twist loop and the director field in 
the plane that intersects +1/2 wedge profiles. (©) Pure-twist disclination loop nucleates and grows from 

a local twist distortion (movie S10). Black arrows indicate the local buildup of the twist distortion. 


Insert shows the top view of a growing twist disclinati 


the director field in the loop's plane. 


from hybrid lattice Boltzmann simulations 
carried no charge. This demonstrates that 
among many possible configurations, topo- 
logically neutral loops are the dominant ex- 
citation mode of 3D active nematics. The 
same class of loop geometries also domi- 
nated the dynamics in our numerical sim- 
ulations of bulk 3D active nematics and in 
confined active nematics (Fig. 4, E and F) 
(25, 26, 32, 33). The phenomenology observed 
is a direct consequence of activity-induced 
flows and is insensitive to backflows induced 
by reactive stresses. This conclusion is sup- 
ported by the agreement of results from the 
mechanical model considered in the hybrid 
lattice Boltzmann method and the purely kin- 
ematic Stokes method. 

In 2D active nematics, self-amplifying bend 
distortions give rise to the nucleation of a 
pair of topological defects of opposite charge 
(12-18). Nucleation of isolated, topologically 
neutral wedge-twist loops are the 3D analog of 
the 2D defect-creation process. Specifically, a 
cross-section through the +1/2 and -1/2 wedge 
profiles recalls unbinding of a pair of point 
disclinations in 2D (Fig. 5, A and B). The +1/2 
wedge profile typically appears on the side of 
the growing bend distortion, oriented away 
from the -1/2 wedge profile. Similarly, wedge- 
twist loops with the +1/2 wedge profile ori- 
ented inward toward the -1/2 wedge are 
driven to shrink by active and passive stresses. 
Unlike in 2D active nematics, after nucleation, 
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on loop. (D) Schematic of a pure-twist loop and 


the wedge profiles remain bound to each other 
through a disclination loop that includes 
points with a local twist winding. It is possible 
that some analyzed pure-twist loops have 
evolved from wedge-twist loops by continu- 
ous deformation of local winding character. 
However, both simulations and experiments 
showed cases of loop nucleation in nearly 
pure-twist (y ~ 0) geometries from previ- 
ously defect-free regions. Local active nematic 
stresses alone are not expected to drive growth 
of a pure-twist loop (Fig. 5D). One possibility 
is that long-range hydrodynamic flows build 
up twist distortions that locally relax through 
creation of a pure-twist loop (Fig. 5C and 
movie S10). 

By coupling a flow field to an orientational 
order parameter with curvilinear topological 
defects, 3D active nematics display dynamics 
even more complex than the chaotic flows of 
2D active systems. Combined with emerging 
theoretical work (32, 33), the experimental 
model system described herein offers a plat- 
form with which to investigate the role of to- 
pology, dimensionality, and material order 
in the chaotic internally driven flows of ac- 
tive soft matter. Furthermore, the use of a 
multiview light sheet imaging technique dem- 
onstrates its potential to unravel dynamical 
processes in diverse nonequilibrium soft ma- 
terials, such as relaxation of nematic liquid 
crystals upon a quench or their deformation 


under external shear flow (3, 34). 
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ELECTRON MICROSCOPY 


Single-atom vibrational spectroscopy in the scanning 
transmission electron microscope 


F. S. Hage!, G. Radtke@*, D. M. Kepaptsoglou’’, M. Lazzeri”, Q. M. Ramasse'** 


Single-atom impurities and other atomic-scale defects can notably alter the local vibrational responses 
of solids and, ultimately, their macroscopic properties. Using high-resolution electron energy-loss 
spectroscopy in the electron microscope, we show that a single substitutional silicon impurity in 
graphene induces a characteristic, localized modification of the vibrational response. Extensive ab initio 
calculations reveal that the measured spectroscopic signature arises from defect-induced pseudo- 
localized phonon modes—that is, resonant states resulting from the hybridization of the defect modes 
and the bulk continuum—with energies that can be directly matched to the experiments. This finding 
realizes the promise of vibrational spectroscopy in the electron microscope with single-atom sensitivity 
and has broad implications across the fields of physics, chemistry, and materials science. 


hanges in the normal mode frequencies 

of dynamical systems that arise from 

the presence of impurities have been 

studied since the 19th century, which has 

resulted in the set of classical theorems 
now referred to as the Rayleigh theorems (J, 2). 
However, the modern theory of defect modes 
in crystals was established in the 1940s with 
the pioneering work of Lifschitz (3). Many 
studies followed, mainly based on optical spec- 
troscopies (4), which identified two types of 
nontrivial defect-induced modes known as lo- 
calized and resonant modes. Resonant modes 
are also called quasi- or pseudo-localized modes 
because, despite being spatially extended, they 
involve a large-amplitude vibration of the im- 
purity itself. Defect modes can control ma- 
terials’ properties such as electric and heat 
transport or, more generally, processes that 
are affected by the scattering of electrons or 
phonons. This can be exploited, for example, 
to suppress heat propagation in thermoelec- 
trics using rattler modes (5), to tune the super- 
conductivity in two-dimensional films (6), or to 
affect the optoelectronic properties of conduct- 
ing polymers (7). Although the existence of an 
atomically localized spectroscopic signature of 
single-atom defects has long been discussed (8), 
conventional vibrational spectroscopies typi- 
cally average information over much larger 
length scales. 

Vibrational electron energy-loss spectroscopy 
(EELS) in the scanning transmission electron 
microscope (STEM) has recently emerged as 
a powerful means of probing the vibrational 
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response of materials at a spatial resolution 
that is superior to that of other experimental 
techniques (9, 10). Tip-enhanced Raman spec- 
troscopy (TERS) (77) and inelastic electron 
tunneling spectroscopy (IETS) (72, 13) provide 
high spatial and energy resolution alternatives, 
but they are strictly limited to surface exper- 
iments and, therefore, present challenges for a 
range of applications. Vibrational STEM-EELS, 
on the other hand, takes advantage of versatile 
probe-forming optics to offer ground-breaking 
capabilities: nanometer-scale thermometry 
(14), mapping of bulk and surface-phonon- 
polariton modes (15), establishing phonon dis- 
persion diagrams from nano-objects (J6), and 
site-specific isotopic labeling in molecular 
aggregates (17). These reports highlight the 
complementarity of STEM-EELS with con- 
ventional vibrational spectroscopies whose 
energy resolutions remain unmatched. How- 
ever, the ultimate promise of vibrational STEM- 
EELS is the ability to reach the single-atom 
or molecular level, in the same way that modern 
microscopes have enabled electronic structure 
analysis (78), plasmonic (19) and UV-optical re- 
sponse fingerprinting (20), and energy-dispersive 
x-ray spectroscopy (21) from single atoms. Atom- 
ically resolved phonon maps of bulk systems 
are preliminary steps in this direction (22). 

In this work, we use STEM-EELS to mea- 
sure the localized vibrational signature of a 
single trivalent substitutional Si atom in single- 
layer graphene (Si@Gr). From ab initio simu- 
lations, we attribute the measured atomic-scale 
spectroscopic response to scattering by pseudo- 
localized vibrational modes arising from a 
resonance between the Si impurity-specific 
modes and the bulk continuum. 

Figure 1A illustrates how electron beam 
deflectors are adjusted to displace the EEL 
spectrometer entrance aperture by 69 mrad 
(or an 8.87-A* momentum transfer) with re- 
spect to the bright field (BF) disc so that 
these no longer overlap. Further details of the 
experimental geometry are provided in the 
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supplementary materials (fig. S1). Compared 
with a conventional on-axis geometry, where the 
EELS aperture is centered on the BF disc, this 
off-axis or dark-field EELS geometry marked- 
ly suppresses the relative contributions of elec- 
trons having undergone elastic and delocalized 
phonon scattering, favoring instead highly 
localized impact phonon scattering (23). This 
approach makes it possible to record atomic- 
resolution phonon scattering maps of nanometer- 
thick flakes of hexagonal boron nitride (22) 
or of single-layer graphene (fig. S2), where 
the off-axis geometry is key because the on-axis 
EELS phonon response of graphene is vanish- 
ingly small (24). Note that the large beam con- 
vergence that is necessary for an atomic-sized 
probe results in spectral integration over a 
range of momentum transfer in the sample 
plane. To achieve a signal-to-noise ratio suf- 
ficient for resolving the phonon loss spectrum 
fine structure, the electron beam is scanned 
repeatedly over a small window, tightly de- 
fined around the impurity of interest, while 
the spectrum intensity is accumulated (25). 

Figure 1B shows a dark-field EEL spectrum 
from a single Si atom impurity in graphene 
(labeled Si) alongside that acquired from a 
comparably sized region of pristine graphene 
dabeled C), located only a few atoms away 
from the Si impurity. The relative positions of 
the two scanned regions are indicated by red 
(Si) and blue (C) boxes on the high-angle an- 
nular dark-field (HAADF) image in Fig. 1C. A 
close-up of the probed Si atom (Fig. 1D) and 
the corresponding fine structure of the Si Lys 
ionization edge (fig. S3C) confirm that the 
brighter-contrast Si atom is trivalently substi- 
tuted into the graphene lattice. Asymmetric 
annular dark-field (aADF, thus denoted be- 
cause of the off-axis geometry) movies were 
recorded during spectrum acquisition to mon- 
itor possible beam-induced structure modifi- 
cations, while ensuring that the probed atom 
remained centered within the scanned region. 
Averaged aADF movies are shown as insets in 
Fig. 1B, with individual frames shown in the 
supplementary materials. 

The Si and C spectra in Fig. 1B are normal- 
ized to the maximum of their respective zero- 
loss peaks (ZLPs). As a result, the tails of the 
ZLPs closely overlap immediately before the 
first observable loss features, which allows for 
a straightforward visual comparison of rela- 
tive changes in energy loss caused by inelastic 
scattering by phonons. Any change in spec- 
trum intensity above the coinciding ZLP tails 
should be representative of differences in rela- 
tive phonon scattering probability. The fine 
structure in the phonon energy range of the 
two recorded spectra is strikingly different. 
Although the C spectrum is consistent with 
that of nondoped bulk graphene (24), the Si 
spectrum comprises phonon loss features at 
different energies. 
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Fig. 1. Experimental geometry and vibrational STEM-EEL spectrum of a Si 
impurity in graphene. (A) Beam deflectors shift the BF disc away from the EEL 
spectrometer entrance aperture (Ap.) in the diffraction plane. (B) Normalized 
vibrational EEL spectra of a substitutional Si impurity and of defect-free 
graphene. Insets show aADF images of the repeatedly scanned sample regions. 
Smoothed spectra (thin lines) are superimposed on the raw data (shading 
around the lines). (©) HAADF overview of the experimental region. Red and blue 
boxes indicate the positions of the sub-scan regions from which the Si and C 


Figure 1E shows, in greater detail, the pho- 
non loss region of the spectra. The C spec- 
trum exhibits two distinct loss peaks at 85 meV 
(685 cm“) and 170 meV (1371 cm”). Following 
(24), we attribute these peaks to scatter- 
ing by transverse (T) or longitudinal (L), 
acoustic (A) or optical (O) modes in graphene, 
respectively (the graphene-phonon dispersion 
diagram is presented for reference in fig. S10). 
Spectral contributions of out-of-plane phonon 
modes are expected to be negligible, as the in- 
cident electron beam is normal to the graphene 
plane. Despite stemming from a position only 
a few atoms away, the Si spectrum shows a 
remarkably different phonon fine structure 
comprising a prominent loss peak at about 
55 meV (443 cm’) and weaker structures at 
125 and 150 meV (1008 and 1209 cm™'). To 
enhance the differences between the spectra, 
we subtracted the C from the Si spectrum, and 
the resulting difference spectrum is shown in 
Fig. 1, E and F. This has the additional benefit 
of effectively subtracting the elastic scattering 
ZLP tail (making the reasonable assumption 
that the tail contribution, before any expected 
loss contribution, is similar between spectra) 
without possible errors associated with com- 
mon background removal techniques, as dis- 
cussed in the supplementary materials (fig. 
S4). Thus, the difference can be interpreted as 
a relative change in phonon scattering prob- 
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ability induced by the presence of the single 
Si atom impurity. Virtually identical results 
(detailed in the supplementary materials) were 
obtained from complementary measurements 
carried out in a different area of the sample. 
These experimental results lead to the remark- 
able conclusion that the single Si atom impurity 
in graphene possesses a characteristic vibra- 
tional signature localized at the atomic scale. 

To gain insights into the physics associated 
with these results, we have calculated, within 
the framework of density functional theory 
(DFT) (26) and using periodic boundary con- 
ditions, the vibrational spectrum of a large 
96x96 supercell of graphene (96 unit cells by 
96 unit cells) containing one substitutional 
Si atom. The structure of the defect and com- 
putational details are presented in the sup- 
plementary materials. As discussed therein, 
the important features observed in the vibra- 
tional EEL spectra of graphene can be safely 
interpreted in terms of the phonon density 
of states (DOS) of the bulk. The local be- 
havior of the DOS can be quantified by the 
projected phonon DOS (PPDOS), defined as 
nX(@) = YS les)5(o — @,), Where « denotes 
a specific atom, w, and e, are the phonon an- 
gular frequency and normalized polarization, 
and the sum is carried over all the phonon 
modes, v, of the supercell. Because the momen- 


tum transfer occurs predominantly in the 
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(F) Comparison of the calculated differential PPDOS (broadened to match the 
experimental resolution) and the experimental difference spectrum. The blue and 


energy ranges where the contributions of the Si 


impurity and its three nearest neighbors, or that of bulk graphene, are 


, arbitrary units. 


plane perpendicular to the electron beam 
trajectory in our experiments, only the com- 
ponents of the phonon polarization that are 
parallel to the graphene plane are relevant. A 
tentative comparison to the experimental dif- 
ference spectrum is then provided by combin- 
ing the PPDOS projected on the Si atom, ns 
the PPDOS projected on its three C neighbors, 
n“; and the bulk phonon DOS per atom, bulk, 
n(@) = [n(@) + 8n“(@) — 4nK(@)] /4, This 
differential PPDOS reflects the experimen- 
tal spectrum averaging over the scanning 
window, which is expected to include con- 
tributions from the impurity’s neighboring 
C atoms. The resulting differential PPDOS is 
shown in Fig. 1F, after broadening to match 
the experimental resolution. It predicts all the 
main features of the experimental difference 
spectrum, including a single peak at ~55 meV 
(443 em"), two overlapping peaks at 125 
and 150 meV (1008 and 1209 cm”), and dips 
centered around 100 and 180 meV (807 and 
1452 cm”). 

The physical origin of these spectral fea- 
tures can be understood by considering the in- 
dividual in-plane PPDOS employed to construct 
the differential PPDOS, 7, and the PPDOS of 
C atoms located at increasing distances away 
from the Si impurity (Fig. 2A). The Si PPDOS 
is dominated by an intense peak at 55 meV, 
closely matching the low-energy experimental 
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Fig. 2. Localization of the vibrational signal. (A) Calculated in-plane component of the phonon DOS 


projected on the Si and C atoms at increasing distances from the impurity. Overlaid light gray lines show the 


bulk graphene phonon DOS per atom, fully recovered from atom 6. The curves are vertically shifted and 


smeared by a 2-meV FWHM Lorentzian for clarity. (B) Sketch of the position of the C atoms, labeled 1 to 6, 
and Si impurity (red sphere). (C) Background-subtracted experimental spectra acquired at equivalent atomic 


positions. Smoothed (black lines) and raw (gray dots) data are overlaid. 
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Si component 


feature seen in Fig. 1E. This peak is followed 
by a broad band with weaker structures at 105, 
127, and 155 meV (847, 1024, and 1250 cm”). 

The absence of intense features in the bulk 
graphene DOS at 55 meV implies that the 
corresponding modes should possess a degree 
of localization. By inspecting the PPDOS of 
neighboring C atoms in Fig. 2, it is evident 
that, although the first two C neighbor shells 
coordinating the impurity still display traces 
of the 55 meV peak, its contribution is weak. 
The PPDOS of subsequent neighbors rapidly 
tends toward the bulk signature, which is fully 
retrieved after six shells. Corresponding atom- 
ically resolved experimental spectra in Fig. 2C, 
from a full-spectrum image over equivalent 
C neighbor positions (fig. S8), exhibit an iden- 
tical trend: The EELS signal reproduces the 
main features observed in the in-plane PPDOS. 

It is instructive to consider a calculation 
performed on a smaller 13-atom fragment of 
C3, Symmetry centered on the impurity (fig. 
$9), decoupled from the supercell by artifi- 
cially setting the interatomic force constants 
linking the fragment to the rest of the 96x96 
supercell to zero. The fragment displays two 
modes with F symmetry at 52 and 124 meV 
(419 and 1000 cm”), involving large in-plane dis- 
placements of the Si atom either in phase (mode 
A) or out of phase (mode B) with the neigh- 
boring C atoms (Fig. 3). The resonances in the 
full Si@Gr system, simulated by the 96x96 
supercell, can thus be interpreted as a hybrid- 
ization of these local impurity modes with the 
vibrational continuum of the graphene bulk. 

The associated atomic displacements, in- 
cluding those arising from the in-plane vibra- 
tion of the Si atom, do not decay far from the 
defect; the full system presents a delocalized 
continuum, a concept quantified with the in- 
verse participation ratio analysis shown in 
the supplementary materials. However, these 
delocalized phonon modes possess an enhanced 
component atomically localized on the im- 
purity. The power of EELS is the technique’s 
ability to probe this quasi-localization, there- 
by revealing the paradoxical nature of defect- 
induced resonant modes. It is also notable 
that the experimentally measured ~30 meV 
(242 cm~) full-width at half-maximum (FWHM) 
of the impurity peak at ~55 meV (Fig. 1F) 
closely matches the intrinsic theory-predicted 
width of the resonant mode (Fig. 3A). The 
experimental energy resolution is therefore 
not limiting, and the EEL spectra faithfully 
capture the fine structure of the Si@Gr sys- 
tem’s vibrational response. 

Localized and resonant modes arising from 
point defects have been widely discussed (8). 
The former are characterized by frequencies 
lying out of the continuum of the unperturbed 
crystal and atomic amplitudes dying off faster- 
than-exponentially with increasing distance 
from the defect (27). By contrast, the latter 
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occur at frequencies lying within the allowed 
bands of the host. The recognition of reso- 
nant modes was delayed by their peculiar 
characteristics, where the vibrational ampli- 
tude does not vanish far from the defect, ex- 
tending instead over the entire crystal (28). 
Furthermore, experimental observations of 
these effects have, thus far, been limited to 
indirect fingerprints, often at the macroscopic 
scale. Volgmann e¢ al. (13) used scanning probe 
microscope (SPM)-IETS to detect a local energy- 
dependent increase in phonon DOS on a Ag (100) 
surface, which they attributed to a substitu- 
tional Cu atom. However, the surface nature 
of these experiments and the lack of more 
direct visualization means precluded an un- 
ambiguous interpretation. 

In contrast, the ability demonstrated in this 
work to directly measure, at the atomic scale, 
the localized component of the vibrational 
signature of a single impurity atom within a 
solid, and to match the observed spectral fine 
structure to theoretically predicted modes, 
realizes the potential of phonon spectroscopy 
in the STEM. The STEM-EELS technique, char- 
acterized by single-atom defect sensitivity com- 
bined with isotope selectivity (17) and the ability 
to operate at cryogenic temperatures (29), now 
enables potential experiments where a single 
functionalizing isotope is fingerprinted at the 
atomic scale through its vibrational signature. 
The approach should be applicable to three- 
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dimensional structures, although challenges 
will arise from the complexity of the compu- 
tational work necessary to inform these ex- 
periments. Nevertheless, this opens up a path 
to further applications in solid-state science, 
where the electron beam of the STEM can be 
used to assemble functional devices atom by 
atom (30) and to spectroscopically probe the 
resulting lattice dynamics and their coupling 
with other quasiparticles. 
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QUANTUM SIMULATION 


A scalable realization of local U(1) gauge invariance 


in cold atomic mixtures 


Alexander Mil’, Torsten V. Zache”, Apoorva Hegde’, Andy Xia’, Rohit P. Bhatt’, Markus K. Oberthaler’, 
Philipp Hauke’2", Jiirgen Berges”, Fred Jendrzejewski? 


In the fundamental laws of physics, gauge fields mediate the interaction between charged particles. 
An example is the quantum theory of electrons interacting with the electromagnetic field, based 
on U(1) gauge symmetry. Solving such gauge theories is in general a hard problem for classical 
computational techniques. Although quantum computers suggest a way forward, large-scale digital 
quantum devices for complex simulations are difficult to build. We propose a scalable analog 
quantum simulator of a U(1) gauge theory in one spatial dimension. Using interspecies spin-changing 
collisions in an atomic mixture, we achieve gauge-invariant interactions between matter and 

gauge fields with spin- and species-independent trapping potentials. We experimentally realize 

the elementary building block as a key step toward a platform for quantum simulations of 


continuous gauge theories. 


auge symmetries are a cornerstone of 

our fundamental description of quan- 

tum physics as encoded in the standard 

model of particle physics. The presence 

of a gauge symmetry implies a concerted 
dynamics of matter and gauge fields that is 
subject to local symmetry constraints at each 
point in space and time (/). To uncover the 
complex dynamical properties of such highly 
constrained quantum many-body systems, enor- 
mous computational resources are required. 
This difficulty is stimulating great efforts to 
quantum simulate these systems, i.e., to solve 
their dynamics using highly controlled exper- 
imental setups with synthetic quantum systems 
(2-4). First experimental breakthroughs have 
used quantum-computer algorithms that imple- 
ment gauge invariance exactly, but which are 
either limited to one spatial dimension (5, 6), 
restrict the dynamics of the gauge fields (7, 8), 
or require classical preprocessing resources 
that scale exponentially with system size (9). 
Recently, the dynamics of a discrete Z) gauge 
theory in a minimal model has been realized 
based on Floquet engineering (10-12). Despite 
these advances, the faithful realization of large- 
scale quantum simulators describing the con- 
tinuum behavior of gauge theories remains 
highly challenging. 

Our aim is the development of a scalable 
and highly tunable platform for a continuous 
U(1) gauge theory, such as realized in quantum 
electrodynamics. In the past years, ultracold 
atoms have become a well-established system 
for mimicking condensed-matter models with 
static electric and magnetic fields (13) and 
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even dynamical background fields for moving 
particles (14-16). These systems possess global 
U() symmetries related to the conservation 
of total magnetization and atom number (7). 
However, a gauge theory is based on a local 


A gauge field 


n-1 n 


symmetry, which we enforce here through spin- 
changing collisions in atomic mixtures. This 
promising mechanism to protect gauge invar- 
iance has been put forward in various proposals 
(18-21) but not yet demonstrated experimen- 
tally. We demonstrate the engineering of an 
elementary building block in a mixture of bo- 
sonic atoms, demonstrate its high tunability, 
and verify its faithful representation of the de- 
sired model. 

We further propose an extended implemen- 
tation scheme in an optical lattice, where each 
lattice well constitutes an elementary building 
block that contains both matter and gauge 
fields. Repetitions of this elementary unit can 
be connected using Raman-assisted tunneling 
(22). Gauge and matter fields are spatially ar- 
ranged in such a way that the spin-changing 
collisions occur within single-lattice wells, in 
contrast to previous proposals (18-21) where 
the gauge and matter fields were spatially 
separated and spin-changing collisions had to 
be accompanied by hopping across different 
sites of the optical lattice. 

We specify our proposal for a one-dimensional 
gauge theory on a spatial lattice, as visualized 


matter field 


n+4 n+2 


B building block 


Fig. 1. Engineering a gauge theory. (A) Structure of a lattice gauge theory. Matter fields reside on sites 
and gauge fields on the links in between. (B) Proposed implementation of the extended system. Individual 
building blocks consist of long spins (representing gauge fields) and matter states, which are confined within 
the same well and whose interaction constitutes a local U(1) symmetry. An array of building blocks in 

an optical lattice is connected via Raman-assisted tunneling. (C) Experimental realization of the elementary 
building block with bosonic gauge (sodium) and matter (lithium) fields. The gauge-invariant interaction is 


realized by heteronuclear spin-changing collisions. 
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Fig. 2. Tunability of the initial conditions. The 
normalized spin z-component L,/L of “7Na atoms 
as a function of the preparation pulse length, which 
shows that the gauge field can be tuned experi- 
mentally over the entire possible range. Simulta- 
neously, the particle number Np /N of 7Li is 

kept in the vacuum state. The inset shows a 
sketch of the experimental protocol used for tuning 
the initial conditions. 


in Fig. 1A. Charged matter fields reside on the 
lattice sites n, with gauge fields on the links 
in between the sites (23). We consider two- 
component matter fields labeled “p” and “v”, 
which are described by the operators (bs, Baw 
To realize the gauge fields with the atomic 
system, we use the quantum link formula- 
tion (24-26), where the gauge fields are re- 
placed by quantum mechanical spins L,, — 
(Lew : Das Leas labeling link operators by the 
index of the site to the left. In this formulation, 
the spin g-component Lag can be identified 
with a discrete “electric” field. We recover the 
continuous gauge fields of the original quan- 
tum field theory in a controlled way by work- 
ing in the limit of long spins (20). 

Physically, this system of charged matter 
and gauge fields can be realized in a mixture of 
two atomic Bose-Einstein condensates (BECs) 
with two internal components each (in our ex- 
periment, we use ‘Li and 7°Na). An extended 
system can be obtained by use of an optical 
lattice. In our scheme, we abandon the one-to- 
one correspondence between the sites of the 
simulated lattice gauge theory and the sites of 
the optical-lattice simulator. This correspon- 
dence characterized previous proposals and 
necessitated physically placing the gauge fields 
in-between matter sites (18-21). Instead, as il- 
lustrated in Fig. 1B, here one site of the physical 
lattice hosts two matter components, each taken 
from one adjacent site Disa and Bnv)s as well 
as the link (L,,). 

The enhanced physical overlap in this config- 
uration decisively improved time scales of the 
spin-changing collisions, which until now were 
a major limiting factor for experimental im- 
plementations. Moreover, a single well of the 
optical lattice already contains the essential 
processes between matter and gauge fields 
and thus represents an elementary building 
block of the lattice gauge theory. These build- 
ing blocks can be coupled by Raman-assisted 
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tunneling of the matter fields [see supplemen- 
tary materials (SM). 

The Hamiltonian H = wi: Hy + hQ(b}, pOnv+ 
bnvOnp)| of the extended system can thus 
be decomposed into the elementary building- 
block Hamiltonian H,, and the Raman-assisted 
tunneling (with Raman frequency ~Q). Here, 
An reads (writing bp = = Basix: b, = = a and 
L=L es) 


r _ ats 
fik= 5 (Bybp — by by) 
(6, £_by + 8,2, Bp) (1) 


where L, =L,+il, and L_=L,—-“iLy. 
The first term on the right-hand side of Eq. 1, 
which is proportional to the parameter y, de- 
scribes the energy of the gauge field, and the 
second term, ~A, sets the energy difference 
between the two matter components. The last 
term, ~A, describes the U(1) invariant coupling 
between matter and gauge fields, which is 
essential to retain the local U(1) gauge sym- 
metry of the Hamiltonian H (see SM for more 
details). 

We implemented the elementary building 
block Hamiltonian H yn With a mixture of 
2L ~ 300 x 10° sodium and N ~ 50 x 10? lithium 
atoms as sketched in Fig. 1C (see SM for de- 
tails). Both species were kept in an optical dipole 
trap such that the external trapping potential 
is spin insensitive for both species. An external 
magnetic bias field of B ~ 2G suppressed any 
spin change energetically, such that only the 
two Zeeman levels, m,- = 0 and 1, of the F = 1 
hyperfine ground state manifolds were popu- 
lated during the experiment. The Na states are 
labeled as |t) = |mr = 0) and||) = |mpr = 1), 
on which the spin operator L associated to 
the gauge field acts. The first term of Eq. 1 is 
then identified with the one-axis twisting 
Hamiltonian (27, 28). We label the “Li states 
as “particle” |p) = |mp = 0) and “vacuum” 
|v) = |mp = 1), in accordance with the matter 
field operators bp and by. With this identi- 
fication, the second term arises from energy 
shifts due to the external magnetic field and 
density interactions. Finally, the term ~A is 


0 20. 40 60 80.100 
time [ms] 


physically implemented by heteronuclear spin- 
changing interactions (29). 

The resulting setup is highly tunable, as we 
demonstrated experimentally on the building 
block. We achieved tunability of the gauge field 
through a two-pulse Rabi coupling of the Na 
atoms between ||) and |t) using an inter- 
mediate |F = 2) state, which yields a desired 
value of L,/L = (N; — N,)/(N; + .N,) Fig. 2). 
At the same time, un kept the “Li atoms in 
|v), corresponding to the initial vacuum of 
the matter sector at A>», with <1% detected 
in |p) (Fig. 2). 

When the gauge-invariant coupling was 
turned off by removing the Na atoms from 
the trap, we observed no dynamics in the 
matter sector beyond the detection noise. By 
contrast, once the gauge field was present, 
the matter sector clearly underwent a trans- 
fer from |v) to|p) for proper initial conditions, 
as illustrated in Fig. 3A for an initialization to 
L,/L = —0.188 at a magnetic field of By = 
2.118(2)G (30). This observation demonstrated 
the controlled operation of heteronuclear spin- 
changing collisions implementing the gauge- 
invariant dynamics in the experiment. 

To quantify our observations, we extracted 
the ratio N,/N, with N = Ny + Ny, as a func- 
tion of time as presented in Fig. 3B. We ob- 
served nonzero N,/N, describing “particle 
production,” on a time scale of a few tens of 
milliseconds, with up to 6% of the total V being 
transferred to |p). This value is consistent with 
our expectations from conservation of the ini- 
tial energy Eo /h = xL2 — AN, /2, from which 
we estimated a maximum amplitude on the 
order of a few percent. Owing to the much larger 
*°Na condensate, the expected corresponding 
change in L, /L is ~2%, which is currently not 
detectable with our imaging routine (see SM 
for details). Coherent oscillations in V,, /N were 
seen to persist for about 100ms. 

We display N,/N over the entire range of 
initial L, in Fig. 4, keeping a fixed time of 30ms. 
The upper panel (A) corresponds to the same 
experimental setting as in Fig. 3. A clear reso- 
nance for particle production can be seen 
around L,/L~ —0.5, approximately captured 
by the resonance condition 2yL, ~ A (see SM 


Fig. 3. Dynamics of particle production. (A) The 
number density distribution in state |p) as a function 
of time for L,/L = —0.188. (B) The corresponding 
particle number N,/N. The blue circles give the 
experimental values with bars indicating the statisti- 
cal error on the mean. The red curve is the 
theoretical mean-field prediction of Hamiltonian 

(Eq. 1) with parameters determined from a fit of 
the data in Fig. 4A and phenomenological damping. 
The shaded area indicates the experimental noise 
floor. The dashed line marks the time of 30 ms as it 
is used for the experimental sequence generating the 
data in Fig. 4. 
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Fig. 4. Resonant particle production. (A to D) The 
number of produced particles as a function of 
initially prepared L,/L after 30 ms for different- 
bias magnetic fields. Blue circles are experimental 
values with bars indicating the statistical error on 
the mean. The red curve in (A) arises from the 
theoretical model using the best estimate values 
of X, A, Ag, and A,. The remaining curves in (B) to 
(D) are computed using the same parameters, 
including Ag. The shaded area indicates confidence 
intervals of the fit from bootstrap resampling. 

The dashed line in (A) indicates the L,/L value 
corresponding to the time evolution shown in Fig. 3. 


for details). The asymmetry of the resonance 
is aclear manifestation of the nonlinearity of 
the dynamics. As we reduced the magnetic 
field B, presented in Fig. 4, B to D, we observed 
a shift of the resonant particle production 
together with a reduction in amplitude, which 
continued to be qualitatively captured by the 
resonance condition 2yL,~ A. The maximal 
amplitude of the particle production was nec- 
essarily reduced by the conservation of total 
magnetization as the resonant peak was 
pushed closer to L,/L = —1. For fields that 
were smaller than Bin ~ 1.96G, the matter 
and gauge field dynamics became too off- 
resonant, and particle production was not 
observed any longer. 

We compare the experimental results to the 
mean-field predictions of Hamiltonian (Eq. 1) 
for chosen x, A, and A(L,, B) = Ap + AL, /Z+ 
Ap(B— B,4)/Ba (see SM for the origin of the 
dependence on the magnetic field B and the 
initial spin Z,,). A first-principle calculation of 
these model parameters, using only the ex- 
perimental input of our setup, yielded xe /2n = 
14.92mHz, 1" /2n = 42.3 Hz, Ab /2n = — '77Hz, 


Mil et al., Science 367, 1128-1130 (2020) 


6 March 2020 


AY /2n = 4.474kHz, and A® /2n ~ — 1.669kHz. 
These values were obtained by neglecting any 
residual spatial dynamics (37) of the atomic 
clouds within the trapping potential, which 
renormalized the model parameters. Moreover, 
the mean-field approximation was not able to 
capture the decoherence observed in Fig. 3 at later 
times. However, the features of the resonance 
data in Fig. 4 were more robust against the de- 
coherence as it probed the initial rise of particle 
production. 

We included the decoherence into the model 
phenomenologically by implementing a damping 
term characterized by y/2n = 3.54(94) Hz, 
which was determined by an exponential en- 
velope fit to the data of Fig. 3B. Physically, the 
damping had several origins: quantum fluctu- 
ations; fluctuations of initial state preparation, 
as well as values of parameters; atom loss; and 
the spatial dynamics of the two species within 
the building block. In particular, the first two 
did not compromise gauge invariance. The last 
two sources of dissipation can in general be 
controlled by reducing the particle density 
and by implementing a deep optical lattice that 
freezes out the spatial dynamics within in- 
dividual wells. Fixing y, the best agreement 
(solid red line) with the data in Fig. 4A was 
obtained for y/2n = 8.802(8) mHz, 4/2n = 
16.46) WHz, Ao /2n = —4.8(16)Hz, andA; /2n = 
2.681(1)kHz. The prediction with these model 
parameters showed excellent agreement with 
the data in Fig. 3B (red line) for all times ob- 
served. Notably, our established model also 
described the data in Fig. 4, B to D, by in- 
cluding Ap /2n = —519.3(3)Hz (see SM). Com- 
pared to the ab initio estimates, all fitted values 
had the expected sign and lie in the same order 
of magnitude. 

Our results demonstrated the controlled 
operation of an elementary building block of 
a U(1) gauge theory and thus open the door 
for large-scale implementations of lattice gauge 
theories in atomic mixtures. The potential 
for scalability is an important ingredient for 
realistic applications to gauge field theory 
problems. Digital quantum simulations of 
gauge theories on universal quantum com- 
puters (5, 9) are challenging to scale up. This 
difficulty makes analog quantum simulators, 
as treated here, highly attractive, because they 
can be scaled up and still maintain excellent 
quantum coherence (6-8, 32-34). Proceeding 
to the extended system requires optical lattices 
and Raman-assisted tunneling [see SM and 
(22)]. The resulting extended gauge theory will 
enable the observation of relevant phenome- 
na, such as plasma oscillations or resonant 
particle production in strong-field quantum 
electrodynamics (35). Along the path to the 
relativistic gauge theories realized in nature, 
we will replace bosonic ’Li with fermionic Li, 
which will allow for the recovery of Lorentz 
invariance in the continuum limit (see SM). 
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Replay of cortical spiking sequences during human 


memory retrieval 


Alex P. Vaz'?3, John H. Wittig Jr’, Sara K. Inati*, Kareem A. Zaghloul'* 


Episodic memory retrieval is thought to rely on the replay of past experiences, yet it remains unknown how 
human single-unit activity is temporally organized during episodic memory encoding and retrieval. We found 
that ripple oscillations in the human cortex reflect underlying bursts of single-unit spiking activity that are 
organized into memory-specific sequences. Spiking sequences occurred repeatedly during memory formation 
and were replayed during successful memory retrieval, and this replay was associated with ripples in the medial 
temporal lobe. Together, these data demonstrate that human episodic memory is encoded by specific 
sequences of neural activity and that memory recall involves reinstating this temporal order of activity. 


etrieving individual episodic memories 

in the human brain relies on our ability 

to internally replay neural patterns of 
activity that were present when the 
memory was first experienced (1-6). This 
suggests a link with a parallel line of work in 
rodents that demonstrated that individual 
neurons in the medial temporal lobe (MTL) 
fire in sequences when animals are navigat- 
ing spatial environments, and that these se- 
quences are replayed during awake periods of 
rest and during sleep (7-14). Replay of se- 
quences of spiking activity has been interpreted 
to reflect memory retrieval and consolidation 
(73) and even memory-related planning (74), 
but no direct evidence exists that demonstrates 
that replay of neural spiking sequences could 
also underlie episodic memory retrieval in hu- 
mans. Neuronal sequences replayed in the ro- 
dent MTL are associated with fast oscillations 
termed “ripples” (15-20). Ripples are also rel- 
evant for episodic memory retrieval in hu- 
mans (21-26), raising the possibility that ripples 
may also be associated with memory-relevant 
replay of spiking activity in the human brain. 
We therefore investigated the relationship 
between cortical ripples and single-unit spiking 
activity in six participants (two female; 34.8 + 
4.7 years of age; mean + SEM). We implanted a 
microelectrode array (MEA) to collect single- 
unit and micro-local field potential (micro- 
LFP) data from the anterior temporal lobe in 
each participant (27, 28) while also collecting 
macro-scale intracranial electroencephalogra- 
phy (iEEG) signals from subdural electrodes 
placed over the lateral temporal cortex and 
along the MTL (Fig. 1, A and B). Recordings 
captured from the cortical iEEG contacts 
placed immediately above the MEA in the 
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middle temporal gyrus (MTG) enabled us to 
simultaneously examine neural activity from 
the same brain region across spatial scales 
(Fig. 1A). We used the iEEG signals to detect 
ripple oscillations in the MTG and the MTL and 
any potential coupling between brain regions (25). 

Ripples present in the iEEG recordings in 
the MTG were accompanied by ripples in the 
underlying micro-LFP signals and a burst of 
single unit spiking activity (Fig. 1C). Ripples 
exhibited band-limited power increases with- 
in 80 to 120 Hz at both macro-iEEG and 
micro-LFP spatial scales (fig. S1). Each ripple 
identified in each microelectrode was accom- 
panied by an increase in single-unit spiking 
activity on that channel (Fig. 1C and figs. S2 
and S3). Cortical spiking was tightly locked to 
the onset of the detected ripple oscillations at 
both the macro-iEEG and micro-LFP scales 
across participants (Fig. 1D and figs. S4 and S5). 
Within each micro-LFP ripple, spikes captured 
from the associated microelectrode channel in 
this cortical region were locked to the ripple 
trough, which is consistent with the relationship 
between spiking and ripple activity observed in 
rodents and humans (Fig. 1, E and F) (/8, 29). 

Each participant performed a paired-associates 
verbal memory task (25, 27) that required 
them to encode and subsequently retrieve new 
associations between pairs of randomly selected 
words on each trial (Fig. 2A and supplementary 
materials). Although we found a strong relation 
between cortical spiking and detected ripple 
oscillations, we focused our analyses on the 
bursts of single-unit spiking activity and their 
temporal structure during memory formation 
(9-12). We defined a burst event as the time 
indices during which the cortical spiking ex- 
ceeded a population rate-based threshold for 
at least 25 ms (supplementary materials). Across 
all participants, burst events had a mean fre- 
quency of 1.4 + 0.2 Hz, and each burst involved 
39.9 + 6.3% of all identified units within that 
session. Burst events occurred repeatedly 
throughout the duration of the word pair pre- 
sentation (Fig. 2B). We reordered the units in 
each trial according to a template sequence that 


we derived from the relative timing of spiking 
activity between pairs of units during each en- 
coding period. We used this template sequence 
to visualize, but not analyze, the temporal struc- 
ture of unit activity across multiple burst events 
during both encoding and retrieval periods from 
the same trial (supplementary materials). Units 
within individual burst events appeared to 
preserve the same sequential order of firing 
throughout the duration of encoding (Fig. 2C). 

Because we observed repeated sequences of 
neuronal firing while participants were encod- 
ing the word pairs, we quantified the extent to 
which the sequences of neuronal firing within 
the burst events were consistent within each 
individual trial and different between trials. 
For each burst event, we determined the se- 
quence of spiking activity across units within 
that particular burst event by ordering each 
neuron according to when its maximum firing 
rate occurred in a +75-ms window around the 
center index of the burst event (fig. S6). We 
found several examples of units that formed a 
sequence in one trial during word pair pre- 
sentation and rearranged to form a different 
sequence in another trial (Fig. 2D). To exam- 
ine how similar any sequence was to any other 
sequence, we computed the matching index 
(MI) (72). The MI compares the pairwise tem- 
poral relationships between all units that are 
common to both sequences and takes on a 
value of 1 for perfect forward replay and -1 for 
perfect reverse replay (supplementary mate- 
rials). We computed the average pairwise MI 
between all sequences within each trial and 
compared this with the distribution of MI val- 
ues that arises when comparing all pairwise 
combinations of sequences across different 
trials. Across participants, in correct, but not 
incorrect, encoding trials, sequences were sig- 
nificantly more similar to other sequences with- 
in the same trial than to sequences in other 
trials [n = 6 participants, paired ¢ test; correct 
within-trial versus across-trial, t(5) = 3.26, P = 
0.023; incorrect within-trial versus across-trial, 
t(5) = 1.47, P = 0.202; correct versus incorrect, 
t(5) = 3.68, P = 0.014] (Fig. 2E). This differ- 
ence between correct and incorrect trials was 
not observed when calculating the similarity 
of unit identity (in which “identity” is defined as 
a binary vector on the basis of whether or not a 
unit fired within a burst event) (supplementary 
materials and fig. S7). This suggests that a nec- 
essary component of successful memory en- 
coding, and later retrieval, involves repeated 
sequences of cortical spiking activity that are 
specific to each trial. 

If successful memory encoding relies on the 
temporal order of neuronal firing, then we 
hypothesized that successful memory retrieval 
would involve replay of the same trial-specific 
sequence. In one trial, the sequence of cortical 
spiking observed in a single burst event during 
encoding was replayed in a burst event during 
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memory retrieval (Fig. 3A and fig. S8). Across 
the entire trial, we observed repeated burst 
events during both the encoding and retrieval 
portions of the paired associates memory task 
(Fig. 3B). We calculated the average sequence 
similarity (MJ) of each retrieval burst event to 
all encoding events. Over the course of mem- 
ory retrieval, sequences appeared to become 
more similar to the encoding sequences until 
the moment when the participant vocalized 
their response (Fig. 3C and fig. S9). Across 
participants, we found that this pattern of in- 
creasing sequence similarity was recapitulated 
in correct, but not incorrect, trials (Fig. 3D and 
fig. S10). Before vocalization, retrieval sequences 
were significantly more similar to encoding se- 
quences during correct compared with incorrect 
trials (P < 0.001, permutation test) (Fig. 3D). 
We replicated these results within individ- 
ual participants (fig. S11) and performed an 
N-way (factorial) analysis of variance (ANOVA) 
with participants and correctness of response 
(binary between correct and incorrect) as the 
independent variables for each trial and the 
corresponding sequence replay value as the ob- 
servation. We observed significant effects for 
correctness of response [Fg = 4.55, P < 0.001] 
but not for participants [F; = 1.14, P = 0.340]. 
We also confirmed this difference between 
correct and incorrect trials by using an alter- 
native metric of sequence similarity and by 
using nonparametric statistical tests (fig. S12). 
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(*P < 0.05, **P < 0.01). 


Sequence similarity was not correlated with 
the similarity in unit identity between burst 
events [correlation coefficient (7) = 0.07 + 
0.03; n = 6 participants, (5) = 1.80, P = 0.132] 
or with the number of units that were common 
to both sequences [7 = 0.06 + 0.03; n = 6 par- 
ticipants, (5) = 2.04, P = 0.096] (fig. S13), and 
the difference in sequence similarity between 
correct and incorrect trials did not arise owing 
to a difference in burst rates or ripple rates be- 
tween conditions (figs. S14 and S15). We found 
significant differences in sequence similarity 
between correct and incorrect trials in both 
the presence of detected cortical macro-iEEG 
ripples and even when ripples were not ex- 
plicitly detected when using our criteria (fig. S16). 
We did not find that sequences were replayed 
during the rest period between retrieval trials 
or during the math distractor period (fig. S17). 
We also did not find any evidence of signifi- 
cant reverse replay during correct retrieval 
and found that the duration of the sequences 
were not significantly different between cor- 
rect encoding and retrieval [108.0 + 10.4 ms 
versus 107.8 + 10.8 ms; 7 = 6 participants, t(5) = 
0.180, P = 0.865] (fig. $6). 

Our data, which demonstrates that trial- 
specific sequences observed during encoding 
are replayed during retrieval, suggest that se- 
quence replay should also be specific to each 
retrieval trial. We compared each sequence 
during the last second of the retrieval period 


indicates a single participant, and bars 


indicate SEM across all participants 


with sequences from other encoding trials by 
using a shuffling procedure. We first shuffled 
all trials by calculating the replay of each cor- 
rect retrieval trial for all nonmatching encod- 
ing trials. The true encoding-retrieval replay 
value was significantly greater than the values 
computed by using the shuffled pairs across all 
participants [n = 6 participants, paired ¢ test, 
t(5) = 3.49, P = 0.018] (Fig. 3E). We next 
shuffled the encoding trial labels using only 
the correct trials and then by swapping only 
the encoding trial labels from adjacent correct 
trials. The true unshuffled average sequence 
similarity between retrieval and encoding was 
significantly greater than the average in each 
shuffled condition (one-way ANOVA across 
all categories: F, = 4.70, P = 0.026; post-hoc 
paired ¢ test P < 0.05 for each category pair) 
(Fig. 3F), demonstrating that the replay of cor- 
tical spiking sequences is specific for each 
retrieved memory. This memory specificity ex- 
tended even to individual encoding-retrieval 
sequence pairs (fig. S18). Moreover, both cor- 
rect encoding and retrieval had a lower popu- 
lation spike rate and lower Fano factor compared 
with those of incorrect trials (Supplementary 
materials), suggesting that successful retrieval 
involves replaying precise sequences of sparse 
neural firing (Fig. 3G and fig. S19). 

Burst events observed during retrieval were 
closely associated with ripple oscillations cap- 
tured at both the macro-iEEG and micro-LFP 
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Fig. 4. MTL ripples precede cortical sequence replay. (A) Schematic of proposed mechanism for se- 
quence replay in the cortex. Units are not sequentially organized when cortical bursts occur in isolation 

(left), whereas sequence replay occurs when MTL ripples occur in coordination with cortical bursts (right). 

(B) Example of dynamic coupling between MTL and cortical ripples (top), bursting events (middle), and sequence 


replay during successful memory retrieval (bottom). (C) Average cortical 


eplay values relative to onset of MTL 


ripples in correct retrieval trials (blue), and the same data when MTL ripple temporal indices were jittered 
randomly (gray). Error bars represent SEM across all participants (***P < 0.001, permutation test). (D) Sequence 
replay during retrieval after dividing all sequences into those that were uncoupled and those that were coupled 
to MTL ripples. Each line indicates a single participant, and bars indicate SEM across all participants. Coupled 


sequences exhibited higher replay values than those of 


scale (Fig. 1C), yet only some of these cortical 
events were also coupled to ripples in the MTL. 
Previous evidence has suggested that successful 
memory retrieval involves a ripple-mediated 
interaction between the cortex and the MTL 
(18, 25). Although each cortical ripple may re- 
flect an underlying burst of spiking activity, 
only cortical burst events that are coupled to 
MTL ripples may be preferentially relevant for 
memory retrieval (Fig. 4A). We therefore hy- 
pothesized that sequences of spiking activity 
that occurred in association with MTL ripples 
during retrieval were more similar to encod- 
ing sequences than bursts of spiking that were 
not coupled with the MTL. In correct retrieval 
trials, we found several examples in which burst 
events associated with MTL ripples exhibited 
higher average sequence similarity to the en- 
coding period than those burst events that oc- 
curred in the absence of a MTL ripple (Fig. 4B). 
We explicitly quantified replay in the cortex 
triggered to the onset of MTL ripples and found 
that maximal replay occurred ~100 ms after 
MTLripple onset (significant replay compared 
with jittered ~100 to 175 ms) (Fig. 4C). We there- 
fore used this temporal relationship to designate 
every cortical burst event as coupled or uncou- 
pled to a MTL ripple (supplementary materials) 
and examined the replay content of each type 
of cortical burst event. In correct retrieval trials 
across all participants, burst events coupled to 
MIL ripples demonstrated significantly greater 
replay of the sequences present during encoding 
compared with uncoupled events [7 = 6 par- 
ticipants, 4(5) = 2.85, P = 0.036] (Fig. 4D). 
Together, our data demonstrate that ripple 
oscillations reflect bursts of spiking activity 
in the human cortex and that these bursts 
contain item-specific sequences of single-unit 
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uncoupled sequences (*P < 0.05). 


spiking that are established during memory 
encoding and replayed during memory retrieval. 
Cortical spike replay is enhanced during retrieval 
when bursts of spiking in the cortex are coupled 
with ripples in the MTL. Our data therefore 
suggest that successful memory encoding and 
retrieval of individual items involves a specific 
temporal ordering of cortical spiking activity 
and provide a link between previous evidence 
regarding the role of ripple oscillations in hu- 
man memory (21-26) and evidence regarding 
the replay of neuronal sequences observed in 
rodents (7-14). Studies of spike replay in ro- 
dents have largely focused on sequences of 
spiking activity that emerge during spatial 
navigation and that are replayed during sleep 
or during awake periods of rest. These data 
have inspired models of memory that posit that 
memory formation involves an initial encoding 
state, in which the temporal order of spiking 
activity is established through sequential expe- 
rience, and a subsequent consolidation state, 
during which these sequences of spiking ac- 
tivity are replayed in a temporally compressed 
manner in the MTL (/7, 30). Our task requires 
participants to memorize abstract associations 
between word pairs and to retrieve those as- 
sociations following only a brief distractor pe- 
riod. Hence, our data provide direct evidence 
that awake human memory retrieval involves 
replay of sequences of spiking activity in the 
cortex. Whether such cortical spike replay 
plays a similar role in longer-term memory 
consolidation in the human brain remains 
an open question. Moreover, whereas tem- 
poral compression and reverse replay may be 
common features of spike replay observed in 
rodents, their absence here may be related to 
our task involving neither sequential experi- 
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Efficient tandem solar cells with solution-processed 
perovskite on textured crystalline silicon 


Yi Hou**, Erkan Aydin*, Michele De Bastiani**, Chuanxiao Xiao**, Furkan H. Isikgor?, Ding-Jiang Xue’, 
Bin Chen!, Hao Chen’, Behzad Bahrami*, Ashraful H. Chowdhury“, Andrew Johnston’, 

Se-Woong Baek’, Ziru Huang’, Mingyang Wei‘, Yitong Dong’, Joel Troughton, Rawan Jalmood?, 
Alessandro J. Mirabelli”, Thomas G. Allen”, Emmanuel Van Kerschaver?, Makhsud I. Saidaminov’, 
Derya Baran’, Qiquan Qiao*, Kai Zhu°, Stefaan De Wolf*{, Edward H. Sargent’+ 


Stacking solar cells with decreasing band gaps to form tandems presents the possibility of overcoming the 
single-junction Shockley-Queisser limit in photovoltaics. The rapid development of solution-processed 
perovskites has brought perovskite single-junction efficiencies >20%. However, this process has yet to enable 
monolithic integration with industry-relevant textured crystalline silicon solar cells. We report tandems that 
combine solution-processed micrometer-thick perovskite top cells with fully textured silicon heterojunction 
bottom cells. To overcome the charge-collection challenges in micrometer-thick perovskites, we enhanced 
threefold the depletion width at the bases of silicon pyramids. Moreover, by anchoring a self-limiting passivant 
(1-butanethiol) on the perovskite surfaces, we enhanced the diffusion length and further suppressed phase 
segregation. These combined enhancements enabled an independently certified power conversion efficiency of 
25.7% for perovskite-silicon tandem solar cells. These devices exhibited negligible performance loss after a 
400-hour thermal stability test at 85°C and also after 400 hours under maximum power point tracking at 40°C. 


hrough an intensive worldwide effort, 
perovskite solar cell (PSC) power conver- 

sion efficiencies (PCEs) have increased 
from an initial 3.8% to a certified 25.2% 
during the past decade (J, 2). This pro- 
gress is based on the combination of materials 
properties such as a low energy required for 
crystal formation (3), a sharp optical absorp- 
tion edge (4), and a tunable band gap (5) ideal- 
ly suited for photovoltaic (PV) applications. 
The rapid development of PSCs is further en- 
abled through the use of solution-based coat- 
ing methods to deposit the semiconductors (6). 
These properties also make PSCs attractive 
as top cells for tandem applications that use 
lower band gap bottom cells such as crystal- 
line silicon (c-Si) and copper indium gallium 
selenide (CIGS) (7-18). By reducing thermal- 
ization losses, stacking PV absorbers of de- 
creasing band gap in a multijunction device 
can overcome the Shockley-Queisser efficiency 
limit of 33.7% for single-junction solar cells. The 
combination with a c-Si bottom cell is of par- 
ticular appeal, because single-junction c-Si- 
based technology has come to dominate the PV 
market. However, as c-Si solar cell efficiencies 
approach their practical limits, multijunction 
technologies using a c-Si bottom cell are of in- 
terest to drive further efficiency improvements. 
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Silicon heterojunction technology is attract- 
ive for tandem solar cells because of its high 
PCE and comparatively straightforward tan- 
dem integration. At present, most reported 
monolithic perovskite-silicon tandem devices 
are based on a single-side texturing configu- 
ration: c-Si wafers with their front flat-polished 
so that it is compatible with existing solution- 
based perovskite fabrication processes and a 
textured back side for enhanced light trapping 
relative to that of a double-side polished c-Si 
device (8, 14-16, 18). This configuration, how- 
ever, provides limited light-trapping benefits 
and requires additional antireflection foils that 
do not provide sufficient light trapping com- 
pared with the textured counterpart (78). Fur- 
thermore, the effectiveness of antireflection 
foils can be compromised upon encapsulation. 
The preparation of such atomically smooth 
surfaces is also not practiced industrially owing 
to the high processing costs involved. Build- 
ing up efficient tandems using double-side 
textured wafers—the industry-compatible c-Si 
approach—still challenges the PV community. 

As a result, recent attention has shifted to 
combining perovskites with fully textured c-Si 
(7). The benefits of fully textured tandems have 
been demonstrated previously by Sahli et al. 
using a hybrid two-step deposition method 
combining sequential coevaporation of PbI, 
and CsBr and solution conversion (7). Unfor- 
tunately, the fill factor (FF) was moderate, a 
result of the limited perovskite quality achieved 
using this approach (19, 20). More efficient, 
but also more complex, thermal coevapora- 
tion enables textured tandems; however, the 
large vapor-pressure difference between the 
organic and inorganic components demands 
a high level of control over the deposition rates 
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of each precursor during the evaporation 
process (27). 

Early efforts to deposit perovskites by using 
solution techniques on top of micrometer-sized 
Si pyramids quickly revealed a number of 
hurdles: uncovered Si pyramids, shunt paths, 
and inefficient charge collection in films with 
variable thickness. In addition, the appli- 
cation of conventional surface-passivation 
techniques—a prerequisite for state-of-the- 
art PSC device performance—are incompatible 
with the rough perovskite surfaces that result 
from the underlying c-Si texture (7). So far, 
insulating solution-processed passivants have 
failed to cover rough surfaces with the needed 
consistency of a thickness of a few nanometers. 

Previous reports of perovskite-silicon tandems 
atop textured c-Si bottom cells have relied on 
physical vapor deposition of the perovskite front 
cell, rather than solution-processed perovskite 
cells on textured c-Si bottom cells. We sought to 
develop a high-quality micrometer-thick perov- 
skite to cover the pyramids and, simultaneously, 
to enhance charge collection in these thick films 
through improved drift and diffusion of photo- 
generated carriers. We combined solution- 
processed, micrometer-thick, wide-band gap 
perovskite solar cells with pyramidal-textured 
c-Si bottom cells. This approach achieved a 
threefold enhanced depletion width in the 
perovskite semiconductor at the valleys of Si 
pyramids, improving carrier collection, as re- 
vealed using nanometer-scale kelvin probe 
force microscopy (KPFM). 

To further increase carrier diffusion length, 
we introduced a conformal surface-passivation 
strategy for rough surfaces by anchoring a self- 
limiting passivant on the wide-band gap perov- 
skite surface. This passivant also suppresses 
phase segregation. In addition, micrometer- 
thick perovskites allowed us to maintain a 
plateau of external quantum efficiency (EQE) 
of 92 to 93% across the spectral range of 650 to 
730 nm. The fully textured bottom cells mini- 
mized reflection losses and efficient light trap- 
ping was achieved for the bottom cells, crucial 
to satisfying current-matching conditions. Over- 
all, with the combined enhancements in charge 
drift and diffusion, the best tandem cells herein 
achieved an independently certified efficiency 
of 25.7%, combined with negligible perform- 
ance loss after 400-hour thermal-stability tests 
at 85°C and also after 400 hours under max- 
imum power point (MPP) tracking at 40°C. 

As seen in the scanning electron microscope 
(SEM) top-view images, the textured c-Si fea- 
tures 2-um-sized (111) faceted pyramids that 
were fabricated by alkaline wet-chemical etch- 
ing (Fig. 1A). When forming tandems atop a 
textured c-Si bottom cell, it is important that 
one control the perovskite morphology and 
film thickness. When the perovskite film was 
deposited under fabrication conditions used 
for conventional planar perovskites (generally 
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4NiOx/InOx 


Si 500 nm 


Fig. 1. Microstructure of solution-processed perovskite top cells on 
textured c-Si bottom cells. (A and B) SEM top-view images of the textured 
c-Si surface with an average pyramid size of 2 um (A) and corresponding 
substrates covered by solution-processed perovskite crystal (B). 

(C and D) SEM cross-section images of a textured c-Si with an average 


resulting in 400- to 600-nm-thick films), pyr- 
amids that were not capped with perovskite 
were observed, and these regions created shunt 
paths in devices (fig. S1). To fully cover the 
micrometer-sized pyramids, we used a concen- 
trated (1.65 to 1.75 M) precursor that resulted 
in a micrometer-thick perovskite with large (2 
to 4 um) grain sizes (Fig. 1B). This process en- 
abled us to achieve uniform perovskite cover- 
age of the pyramids and eliminated the need for 
additional flattening processes (Fig. 1, C and D). 
To achieve current matching, we opted to 
broaden the band gap of the micrometer-thick 
perovskite rather than reduce the thickness 
of a smaller-band gap perovskite because the 
latter approach would uncover pyramids. This 
thick, wide-band gap perovskite (1.68 eV, 
CS0,.05MAo15FAp.gPbIs.25Bro,75) top cell also pro- 
vided a path toward a higher ultimate effi- 
ciency limit in tandem solar cells (fig. $2). 
Because the size of the Si pyramids and 
perovskite thickness were similar, this infil- 
trated morphology differs from previously re- 
ported flat and conformal architectures used 
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in perovskite-silicon tandem devices (fig. S3) 
(7, 8, 14, 15). These images also suggest that the 
textured structures substantially modified the 
surface geometry by increasing the contact area. 
Rather than producing a conformal coating with 
uniform thickness, which would be similar to the 
evaporation case, the solution-processed perov- 
skite smoothly overcoated the pyramid geome- 
try while retaining the curvature of the textured 
surface beneath (Fig. 1D). The elemental distri- 
bution within the tandems measured by means 
of energy-dispersive x-ray spectroscopy (EDS; 
Fig. 1E) confirmed the presence of a textured 
layer stack. From the detailed features of the 
perovskite and conformal NiO, interface, we 
did not see undesired accumulation of hole- 
transport layer (HTL) material at the bottom, 
nor did we observe HTL material absence at 
the facets of the Si pyramids (7, 8, 14, 15). 
Thick perovskite layers require sufficiently 
long charge-carrier diffusion lengths to enable 
efficient charge collection. This condition de- 
mands perovskite crystals and surfaces of high 
electronic quality. To increase the performance 
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pyramid size of 2 um (C) and corresponding substrates covered by solution- 
processed perovskite crystal (D). (E) SEM image with corresponding EDS 
maps indicating the distribution of lead (Pb, aqua), bromide (Br, green), 
silicon (Si, purple), and nickel (Ni, yellow) inside the solar cell. Lead and 
bromide distribution were confined to the perovskite layer. 


of single-junction wide-band gap PSCs, much 
research has focused on developing surface 
treatments (22). Trioctylphosphine oxide (TOPO) 
can substantially reduce nonradiative recom- 
bination and increase perovskite stability (23), 
but its insulating nature inhibits charge trans- 
port in devices. In addition, the rough surface 
of textured c-Si devices sets an additional chal- 
lenge in the search for surface passivation. A 
passivant film must conformally coat and pas- 
sivate the rough top perovskite surface with 
readily controlled thickness and functional 
groups without being overly sensitive to surface 
topography, treatment time, and perovskite thick- 
ness. Moreover, this treatment should not dissolve 
the perovskite nor alter its crystal structure (24). 
We pursued an approach in which we ex- 
posed the rough perovskite top surface to 
1-butanethiol vapor, a technique adapted in 
light of previous work performed on thiol-based 
self-assembled monolayer growth on metal 
surfaces. The thiol group anchored to the 
perovskite surface by strong coordination of 
the thiol on Pb**. The thiol molecules rapidly 
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Fig. 2. Effects of SLP on diffusion length and phase segregation in 


micrometer-thick wide-band gap perovskite. (A) TRPL spectra of 
1.68-eV-band gap thin films under different surface treatments. a.u., arbitrary 
units. (B) Normalized PL spectra of control perovskite films after illuminating 
for 0, 10, and 20 min. (C) Normalized PL spectra of SLP-treated perovskite 


films after illuminating for 0, 10, and 20 min. (D) Nan 
of the area indicated by the light blue square in (E) ( 


diffused through the vapor phase to the perov- 
skite surface, in contrast with thiol passivation 
of PbS colloidal quantum dots, where ligand 
exchange occurred in solution. Because of the 
self-limiting nature of thiol passivation, no 
anchoring sites were present after the initial 
passivation for further growth of additional 
layers. Thus, prolonged exposure did not dam- 
age the perovskite. 

We used ultrafast transient absorption spec- 
troscopy to investigate the phase distribution 
of 1.68-eV-band gap perovskite films after dif- 
ferent treatment times. The two classes of 
treated films each showed a single ground- 
state bleaching peak centered at ~725nm 
(fig. S4). This result indicated that the perov- 
skite films existed compositionally within a 
single phase and that the bulk properties of 
the perovskite film could be stabilized with 
the thiol self-limiting passivation (SLP) treat- 
ment. We confirmed identical recombination 
kinetics for the film after 30-min and 20-hour 
SLP treatments: The data (fig. S4) attest to the 
time-insensitivity of the passivation process. 
Similarly, there were no differences in surface- 
sensitive grazing-incidence wide-angle x-ray 
scattering (GIWAXS) measurements between 
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the samples with different SLP treatment 
times. After different treatment times, the 
perovskite peaks maintained their intensities 
and positions, suggesting no change in crystal 
structure and orientation (fig. S5). 

After SLP treatment, we observed in time- 
resolved photoluminescence (TRPL) an increase 
in the carrier lifetime from 570 to 900 ns when 
we measured the samples from air-perovskite 
sides. This is ~15% higher than that of similar 
films treated with TOPO (Fig. 2A) and ~60% 
higher than that of control samples. We pro- 
pose that passivation using smaller-sized thiols 
provided increased diffusion through the vapor 
phase and reduced steric hindrance. When we 
illuminated from the glass-perovskite side, the 
carrier lifetime was essentially similar in these 
three samples (fig. S6). Passivation was most 
effective on the top surface of thick perovskite, 
where, in the pristine case, major carrier re- 
combination would otherwise have occurred. 

The SLP treatment also enhanced phase 
stability of the perovskite. After 20 min of con- 
tinuous light exposure, the SLP-treated films 
exhibited a stable PL peak position, whereas 
the control sample showed a red shift in the 
peak position, indicative of phase segregation 
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the diffusion length of the control perovskite. The white dashed line shows 
the corresponding grain-boundary (GB) area. (E) Contact-mode atomic force 
microscopy (AFM) topography of the control sample (top) and SLP perovskite 
(bottom) (scale bar, 100 nm). (F) Nanoscale-resolved mapping of the area 
indicated by the dark blue square in (E) (100 nm by 100 nm) of the diffusion 
length of the SLP perovskite. The white dashed line shows the corresponding 


(Fig. 2, B and C). This result is in agreement 
with earlier conclusions from Belisle et al. 
demonstrating that charge accumulation and 
carrier trapping at perovskite surfaces are drivers 
of photoinduced halide segregation and that 
efficiently passivating the perovskite surfaces 
suppresses phase segregation and stabilizes 
wide-band gap perovskites (25). 

A representative, randomly selected 100-nm- 
by-100-nm area for topographic imaging that 
included grain boundaries (red square in fig. 
S7) was mapped for the nanoscale charge- 
carrier diffusion length, which is related to 
the local charge-carrier transport time (fig. 
S8) and charge-carrier recombination lifetime 
(fig. S9). Figure 2, D to F, presents nanoscale 
diffusion-length maps of both control sample 
and SLP-treated perovskite at similar regions, 
indicated by the blue squares. The diffusion 
length was improved in both grain and grain 
boundaries for SLP perovskites compared with 
control samples. The diffusion length in SLP 
perovskites within grains was ~570 nm. At 
the grain boundaries, the diffusion length 
of the SLP perovskite decreased by ~50 nm, 
whereas the diffusion length in the control 
sample decreased by ~100 nm (fig. S10). Thus, 
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Fig. 3. Enhanced charge extraction in textured structured tandem cells. 

(A to C) AFM image of a cross section of the peak of a pyramid in the textured and 
flat tandem with a schematic showing each layer across the top of the Si bottom 
cell and the corresponding measured electric-potential zone. (D to F) Potential- 


passivation of the grain boundaries for the 
SLP perovskite increased the charge diffusion 
length compared with the control sample. 
To probe the effects of the textured struc- 
ture on the electrical field in the vertical di- 
rection, we used nanometer-scale KPFM to 
profile and observe the differences in electric- 
potential (or electric-field) distribution across 
the flat and textured devices. To determine the 
bulk profile from the surface measurement, 
we applied different forward-bias voltages of 
land 1.5 V to the device and imaged the cross- 
sectional surface potential under each voltage 
(figs. S11 to S20). The junction characteristic 
was assessed from the small current flow, or 
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equivalent shunt resistance, under bias, which 
was acquired by measuring the voltage drop 
across the cross-sectional surface. 

By taking the first derivative of the potential 
difference, we determined the electric-field 
distribution relative to the metallurgical in- 
terfaces (Fig. 3, A to I). We observed two pro- 
minent junction peaks at the HTL-perovskite 
and perovskite-electron-transport layer (ETL) 
interfaces, whereas the electric field was 
near zero inside the perovskite layer. The 
perovskite-ETL junction has a similar strength 
and depletion width for both flat and text- 
ured cells, which agreed with the perovskite- 
ETL interfaces being identical for these two 


6 March 2020 


Potential difference (mV) 
& 
8 


Electric field difference (a.u.) 


Distance (um) 


c-Si cell 


difference profiling across the device under different bias voltages subtracted from the 
0 V curve. (G to I) Electric-field difference across the device, taken by the first 
derivative in (D) to (F). (J and K) Schematics of electric-field distribution in textured 
(J) and flat tandem devices (K). h*, holes; e”, electrons. 


devices. Interestingly, the electric field at the 
HTL-perovskite interface was enhanced when 
a textured substrate was used (Fig. 3, J and K). 

We determined that the depletion region in 
the perovskite (at the HTL-perovskite inter- 
face) was much wider at the valley of Si pyr- 
amids (~380 nm) than at their top (~120 nm) 
or compared with flat perovskite cells. This 
result implies that where the perovskite was 
the thickest, the depletion width was the 
largest, which would be desirable for effective 
charge collection. At the pyramid valleys, the 
perovskite was more confined and was sub- 
jected to electrical fields superimposed from 
neighboring HTL-coated pyramids. When Si 
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Fig. 4. Device characterization and stability of tandems. (A) Schematic of 
solution-processed perovskite-textured silicon tandem architecture. a-Si:H(n), n-doped 
hydrogenated amorphous silicon; a-Si:H(i), intrinsic hydrogenated amorphous silicon; 
a-Si:H(p), p-doped hydrogenated amorphous silicon. (B) Calculated Jsc values of 

the perovskite cells as a function of perovskite layer thickness. The red dot represents 
the EQE-integrated Jsc value in the textured tandem top cell. (C) Measured weighted 
reflectance as a function of pyramid sizes of c-Si. Texturing size refers to the pyramid 
base. The SEM images of different texturing sizes are reported in the insets (scale 

bar, 2 um). (D) J-V characteristics of flat, textured, and SLP-treated textured tandems. 
(E) J-V characteristics of certified SLP-treated textured tandems. (F) MPP tracking of 


pyramid valleys were sharper, depletion re- 
gions were wider (in a linear relation; detailed 
values are shown in tables S1 and S2), which 
further suggested that the depletion width 
correlates with the geometry factor. In con- 
trast with the case of other deposition meth- 
ods (such as coevaporation), this geometry was 
only observed when the solution-processed 
perovskite cells smoothed out the textured 
silicon cell and caused this beneficial geometry- 
dependent electric-field distribution. This find- 
ing agrees with the simulated results (fig. S21). 

The KPFM observation accounted for device 
performance trends, in that charge collection 
was enhanced in the best pyramid devices. 
The stronger and wider depletion at the valley 
of the pyramids benefited charge collection 
of charge carriers photogenerated with long- 
wavelength incident light. Outside of the de- 
pletion region, the carriers needed to diffuse 
through the perovskite absorber layer to be 
collected. At the top of the Si pyramids, the 
depletion width was not broadened. However, 


here the distance between hole contact and 
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electron contact was much shorter (100 to 
300 nm), which was well within the typical 
range of efficient charge diffusion in a PSC 
(Fig. 3, J and K). In light of the surrounding 
pyramidal structure, charges generated in the 
textured perovskite had a shortened transport 
distance to both contacts compared with a flat 
perovskite cell. 

Figure 4A sketches the cross section of the 
textured tandem. To explore the role of the 
thick perovskite, we studied the thickness- 
dependent current generation theoretically. 
As presented in Fig. 4B, the short-circuit cur- 
rent density (Jsc) of the perovskite cell in- 
creased with layer thickness. The red dot 
represents the experimental Jgc value of the 
~1.1-um-thick perovskite used in this work 
(fig. S22), which is within <10% of the theo- 
retical limit. This certified ~19.3 mA/cm? value 
is among the highest values for a single-pass 
device with 1.68-eV-band gap perovskites (Fig. 
4H). In contrast with opaque devices with re- 
flection from the rear electrode, single-junction 
devices required a thicker absorber to capture 
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certified SLP-treated textured tandems and PCE distributions of 88 individual tandem 
devices. (G and H) EQE of the flat (G) and textured (H) devices (integrated current 

of 18.2 mA/cm? for the top cell and 16.8 mA/cm? for the bottom cell in the flat device: 
19.3 mA/cm* for the top cell and 19.2 mA/cm’ for the bottom cell in the Fraunhofer 
ISE CalLab PV Cells—certified textured device). (I) J-V curves of tandem devices before 
encapsulation (glass and butyl rubber) at the beginning and the end of the 85°C stability 
test. (J) J-V parameters measured over a 400-hour stability test at 85°C (relative 
humidity ~45 to 50%). (K) J-V curves of the tandem devices before encapsulation 
(glass and POE) at the beginning and the end of the MPP stability tests. (L) J-V 
parameters measured over >400 hours of light-soaking under MPP load at 40°C. 


more photons near the band edge. With more 
photons absorbed by the top cell, we expect an 
enhanced upper limit of tandem efficiency. 

Spectrally weighted reflectances were calcu- 
lated from the air mass (AM) 1.5 G spectrum in 
the range of 350 to 900 nm (Fig. 4C). To in- 
vestigate the effect of pyramid size on the 
reflection loss, we also selected different Si 
pyramid sizes by tailoring the texturing pro- 
cess. Although the pyramids in commercial 
Si are typically in the range of 2 to 7 um, the 
benefit to reflection began to saturate when 
the pyramid size reached 2 um or less (see fig. 
$23). This result is encouraging for perovskite- 
silicon tandems because it suggests that well- 
established solution-processing techniques can 
be united with textured silicon (fig. S24). 

We observed enhanced Jgc and FF in tan- 
dems (Fig. 4D and figs. S24 and $25) in the 
case of fully textured c-Si bottom cells. EQE 
measurements highlight the advantage of 
switching from a polished front side to a 
double-side textured architecture (Fig. 4, G 
and H). Reflections occurring in the flat design 
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induced more current loss in the 330- to 
1200-nm range. The introduction of a front 
texture reduced the overall reflectance, espe- 
cially at wavelengths of 550 and 800 nm. The 
Jsc integrated from the EQE spectra is in 
excellent agreement with that derived from 
the 1-sun current density-voltage (J-V) curve 
with enhanced Jgc in both the top cell and bot- 
tom cell on the double-side textured device. 

Both differential EQE and absorption con- 
firm the ~1.68-eV perovskite band gap (fig. S2). 
We observed narrower EQE overlap between 
the perovskite and Si cell. The textured-device 
EQE maintained a plateau at 92 to 93% in this 
region. This result is distinct from previous 
work based on 400- to 600-nm-thick perov- 
skite on flat devices in which the perovskite 
lost EQE at wavelengths of 550 and 800 nm. 
This high EQE photocurrent, combined with 
enhanced charge extraction, resulted in high 
FF, leading to an improved device perform- 
ance from 20.3 to 24.0% (table S3). In par- 
ticular, by further adding SLP treatment, we 
observed a distinct increase of FF from 72 to 
77% (Fig. 4D). By reducing the recombination 
area and improving the top contact design (fig. 
$26), we achieved a Fraunhofer ISE CalLab PV 
Cells-certified stabilized (at MPP) PCE of 25.7% 
(Fig. 4, E and F, and fig. $27). 

The tandems reported herein show low hys- 
teresis (Fig. 4E), high reproducibility (Fig. 4F 
and fig. S28), and excellent operational stabil- 
ity under accelerated tests (Fig. 4, I to L). We 
monitored the stability of devices encapsu- 
lated with glass and butyl rubber. We heated 
these devices to 85°C for 400 hours in the dark 
at ~40% relative humidity and found that the 
devices retained their original performance 
(Fig. 4, I and J). We also monitored 400-hour 
operating stability of devices encapsulated 
using glass and POE (polyolefin encapsulant): 
Here, we used 1-sun-equivalent illumination 
(fig. S29) at 40°C and ~40 to 50% relative 
humidity (Fig. 4, K and L). J-V curves in both 
scan directions [open-circuit voltage (Voc) to 
Jgc and Jgc to Voc] were measured at 10-min 


Hou et al., Science 367, 1135-1140 (2020) 


intervals at 100 mV/s from —0.1 to +1.9 V, and 
vice versa. Between measurements, the tan- 
dem solar cell was held at the MPP voltage as 
determined by the most recent J-V scan. De- 
vices retained their original performance after 
400 hours. In Fig. 4, I to K, the J-V curves of the 
tandem were reported before encapsulation, at 
the beginning of the test, and after 400 hours, 
and images are provided of devices at the 
end of the test. Overall, we attribute this good 
operating lifetime to the replacement of or- 
ganic carrier-selective layers with NiO, inor- 
ganic materials, which prevent iodine reaction 
with small-molecule organic HTLs (table S4). 
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Pervasive functional translation of noncanonical 
human open reading frames 


Jin Chen'?, Andreas-David Brunner’, J. Zachery Cogan”, James K. Nufiez", Alexander P. Fields'2*, 
Britt Adamson’+, Daniel N. Itzhak’, Jason Y. Li*, Matthias Mann*°, 


Manuel D. Leonetti*, Jonathan S. Weissman'+ 


Ribosome profiling has revealed pervasive but largely uncharacterized translation outside of canonical coding 
sequences (CDSs). In this work, we exploit a systematic CRISPR-based screening strategy to identify hundreds 
of noncanonical CDSs that are essential for cellular growth and whose disruption elicits specific, robust 
transcriptomic and phenotypic changes in human cells. Functional characterization of the encoded microproteins 
reveals distinct cellular localizations, specific protein binding partners, and hundreds of microproteins that are 
presented by the human leukocyte antigen system. We find multiple microproteins encoded in upstream 

open reading frames, which form stable complexes with the main, canonical protein encoded on the same 
messenger RNA, thereby revealing the use of functional bicistronic operons in mammals. Together, our results 
point to a family of functional human microproteins that play critical and diverse cellular roles. 


fforts to bioinformatically discover and 

annotate protein-coding open reading 

frames (ORFs) in genomes, termed coding 

sequences (CDSs), have traditionally relied 

on rules such as amino acid conservation 
and homology, translation initiation from an 
AUG start codon, and minimum length (i.e., 
100 amino acids) (7). These rules have been 
widely adopted on the basis of the assump- 
tion that short peptides are unlikely to fold 
into stable structures to perform functions. 
However, the generality of these rules has 
been challenged. For example, the ribosomal 
protein RPL41 is a 25-amino acid (aa) peptide 
and both sarcolipin (SLN, 31 aa) and phospho- 
lamban (PLN, 52 aa) bind to and regulate the 
sarcoplasmic Ca?* transporter SERCA (2, 3). 
Additionally, MYC can be translated from a 
noncanonical start codon CUG (4), which dem- 
onstrates that non-AUG initiation can produce 
functional proteins. Recent studies have added 
a handful of examples of short proteins, or 
microproteins (also called micropeptides or 
just peptides), performing diverse functions 
(5-18), some encoded on transcripts annotated 
as long noncoding RNAs (IncRNAs). Finally, 
upstream ORFs (uORFs), located in the 5’ un- 
translated regions of mRNAs, have long been 
implicated in cis-acting translational control 
of the main, canonical CDS (19-27), though it 
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has remained unclear whether they can gen- 
erate stable, functional peptides. 

Systematic identification of functional short 
CDSs remains challenging. Recent ribosome 
profiling (deep sequencing of ribosome- 
protected fragments) and mass spectrometry 
(MS) studies have identified thousands of 
previously unannotated CDSs (22-25) across 
bacteria, yeasts, viruses, and mammalian cells. 
However, for most cases, the cellular functions 
of these identified CDSs or their peptide pro- 
ducts remain unexplored. We reasoned that 
the advent of CRISPR and its ability to pre- 
cisely disrupt protein-coding regions (26), 
when combined with ribosome profiling, pro- 
vides an opportunity to define and empirically 
characterize the functional protein-coding ca- 
pacity of a given genome. In this work, we ap- 
plied various types of approaches—including 
ribosome profiling, MS, and multiple CRISPR- 
based techniques—to systematically discover 
noncanonical CDSs encoded in the human 
genome and validate their critical roles in 
diverse cellular pathways. 

To annotate potential CDSs comprehensively 
and accurately, we first investigated genome- 
wide translation by ribosome profiling across 
multiple cell types and conditions, including 
human induced pluripotent stem cells (iPSCs), 
iPSC-derived cardiomyocytes, human foreskin 
fibroblasts (HFFs), and HFFs infected with cy- 
tomegalovirus (27, 28) (fig. S1A). We leveraged 
the ORF-RATER algorithm to annotate ORFs 
(27), incorporating multiple lines of evidence 
to identify ORFs undergoing active transla- 
tion. This included consideration of the ac- 
cumulation of ribosome densities at the start 
and stop codons, three-nucleotide periodicity, 
and additional experimental results, such as 
data from harringtonine-treated cells in which 
ribosomes are stalled at initiation sites (27). In 
iPSCs and cardiomyocytes, in addition to 9490 
annotated CDSs (62% of the identified CDSs), 
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we identified 3455 distinct, noncanonical CDSs 
(22%, i.e., with no in-frame overlap with pre- 
viously annotated CDSs) and 2466 variant 
CDSs of annotated proteins (16%) in our high- 
statistical confidence set (Fig. 1A and materials 
and methods) (27). Among the distinct CDSs, 
818 were CDSs on transcripts lacking prior 
protein-coding annotations (“new’, ie., NcRNAs), 
2342 were upstream CDSs (i.e., WORFs or start 
overlaps: CDSs that overlap annotated start 
codons in a different reading frame), and only 
13 were downstream CDSs. Similar numbers 
of CDSs were present in HFFs (fig. S1B), with 
75% of the CDSs shared between the two cell 
types. Of the distinct CDSs, 96% are less than 
100 aa in length, and 36% of the CDSs use non- 
AUG start codons (Fig. 1, B and C; see also fig. 
$2 for further characterizations). 

Multiple lines of evidence suggest that the 
noncanonical CDSs are actively translated. 
The average ribosome density (metagene) of the 
IncRNA CDSs and of the translated uORFs 
closely mirrors footprints from that of annotated 
coding regions with strong three-nucleotide 
periodicities, a hallmark of active translation, 
as exemplified by traces from the IncRNA 
LINCO00998 transcript and a uORF of ARLSA 
(Fig. 1, D and E, and fig. S3). Our analysis also 
successfully recapitulated well-characterized 
short ORFs, such as the uORF on ATF4 (29) 
and the recently discovered IncRNA-encoded 
microproteins MOXI/mitoregulin (17, 12) and 
NoBody (0). Bona fide IncRNAs, such as 
XIST, HOTAIR, and NEAT], were not identi- 
fied to be protein coding (fig. S3E). Moreover, 
many of the CDSs were differentially translated 
during iPSC differentiation or viral infection 
(fig. S3F), providing evidence for translational 
control in different cell states. 

MS-based proteomics in iPSCs and major 
human leukocyte antigen class I (HLA-I) pep- 
tidomics confirmed the stable expression of 
hundreds of noncanonical CDS peptides (Fig. 
1F and figs. S4 and S5). HLA-I peptidomics 
identified 240 noncanonical peptides, which 
suggests that these peptides enter the HLA-I 
presentation pathway and contribute to the 
antigen repertoire and possible immuno- 
genicity (Fig. IF) (30). HLA-I prediction analysis 
cross-validated strong binding (Kg < 50 mM, 
where Kj is the dissociation constant) of non- 
canonical CDS HLA-I peptides to their re- 
spective allotypes (fig. S6) (30). MS-based 
proteomics using tryptic digestion identified 
far fewer noncanonical peptides, which may 
be due to challenges in detecting the trypsin- 
digested products from short, noncanonical 
CDSs or possibly to more rapid turnover of 
these noncanonical peptides (fig. S7). 

To test whether translation of the non- 
canonical CDSs is important for cell growth 
and potentially yields functional peptides, we 
measured the growth phenotypes resulting 
from CRISPR-mediated ORF knockout in 
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pooled screens (26). We designed a Cas9 ORF 
single guide RNA (sgRNA) library to specifi- 
cally knock out thousands of the noncanonical 
CDSs identified by ribosome profiling (Fig. 2A 
and materials and methods) (31, 32), targeting 
1098 uORFs, 613 IncRNA CDSs, 352 extensions 


A 


of annotated coding regions, 283 start over- 
laps, and 7 downstream CDSs. We performed 
pooled Cas9 knockout screens in iPSC and 
K562 chronic myeloid leukemia cells express- 
ing Cas9 and the sgRNA library, akin to con- 
ventional pooled screens for essential proteins 


LINC00998_112757632_59aa 


(26, 31). We measured sgRNA abundance in 
the cell populations shortly after library trans- 
duction and after 10 additional population 
doublings by deep sequencing to quantify the 
fitness defect conferred by each sgRNA. We 
then calculated a phenotype score (y) and 
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Fig. 1. Ribosome profiling and MS reveal translation of unannotated 
CDSs. (A) ORF-RATER analysis of ribosome profiling data: 62% are previously 
annotated coding sequences, whereas 16% are variants of canonical coding 
sequences that share portions of the coding sequence and 22% are distinct 
from annotated coding sequences. The naming convention of the identified 
ORFs is shown on the right. (B) Start-codon usage of the identified CDSs. 
(C) Cumulative distribution of CDS length. For distinct CDSs, 96% are smaller 
than 100 amino acids. (D) Example ribosome profiling traces of a IncRNA 
peptide from LINCO0998 and a uORF peptide from ARLSA displaying the 
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Andromeda Score 
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hallmarks of translation, including peaks of density around the start codon 
following harringtonine treatment and three-nucleotide periodicities along the 
coding region. (E) Metagene analysis shows that the signatures of translation, 
including three-nucleotide periodicity in the expected reading frame, for 
uORFs and IncRNA CDSs are similar to those for annotated coding regions. 
(F) Identification of >200 noncanonical CDS peptides from HLA-I peptido- 
mics, cross-validating their existence across the whole abundance range, with 
a mean Andromeda score of 141 compared with a total mean Andromeda 
score of 144. See materials and methods for further details. 
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Fig. 2. Genome-scale CRISPR screens to identify functional, non- 
canonical CDSs. (A) Schematic of CRISPR library design and screening 
strategies, either by growth screens or Perturb-seq. For growth screens, 
frequencies of cells expressing a given sgRNA are determined by next- 
generation sequencing, and phenotype scores are quantified with the formula 
shown. For Perturb-seq, single-cell transcriptomes and sgRNA identities were 
obtained by single-cell RNA-seq. (B) Volcano plot summarizing knockout 
phenotypes and statistical significance (determined by Mann-Whitney U test) 
for ORFs targeted in the pooled screen in iPSCs. Each dot represents a 
targeted ORF, and ORF hits are labeled in purple, with a more negative 
phenotype score indicating a stronger growth defect. See materials and 


confidence (P value) for each ORF from the 
relative enrichment or depletion of ssRNAs 
targeting a particular ORF (Fig. 2B and ma- 
terials and methods). In iPSCs, our screen 
identified >500 ORF knockout hits that re- 
sulted in statistically significant phenotypes. 
The hits include 169 genes that are variants 
of annotated proteins, 78 start overlap hits, 
230 uORF hits, 91 IncRNA CDS hits, and 2 
downstream CDS hits. iPSC and K562 cells 
had 401 shared hits, suggesting housekeep- 
ing or general cellular roles as well as CDSs 
that may play cell-specific functions (fig. S8). 
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A fraction of the uORF hits do not have main, 
canonical CDSs with fitness defects upon 
knockout. This suggests an independent func- 
tion of the uORFs or that disruption of the 
uORFs leads to increases in main CDS ex- 
pression, which results in the growth phenotype 
(fig. SSE). Thus, unannotated CDSs with impor- 
tant functions across multiple cell types are an 
abundant feature of the genome. 

Several lines of evidence further suggested 
that our screen reported specifically on the 
phenotypes of the selected ORFs. First, the 


phenotypes of control sgRNAs targeted di- 
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methods for further details. (C) Plot of the sgRNA phenotypes and distance 
from the start codon across all ORF hits. sgRNAs targeting the genome 
immediately upstream of the ORF (shown in red) have significantly lower 
phenotype scores than sgRNAs targeting within the ORF (shown in blue). Note 
the axis is increasingly negative (stronger) phenotype. The sgRNA phenotypes 
are quantified by the boxplot to the right. The difference is not because 

of differences in sgRNA on-target efficiencies, as quantified by the Doench v2 
score. (D) The PhyloCSF score per codon (higher scores are 
across the Euarchontoglires) is generally higher for ORF hits (*P = 10°°°, 
Kolmogorov-Smirnov test) and ORFs with a stronger phenotype. Note that 
lack of a growth phenotype does not necessarily imply a low PhyloCSF score. 


more conserved 


rectly upstream of each ORF in the genome 
(Fig. 2C) are significantly weaker than those 
of sgRNAs targeted within the ORF (P = 10°”, 
Mann-Whitney test). Second, sgRNA phenotypes 
are independent of distance to other anno- 
tated proteins, splice sites, or transcriptional 
start sites (fig. SOA). Functionally, ORF hits are, 
on average, more phylogenetically conserved 
with a higher conservation score than non- 
hits (PhyloCSF score per codon, P = 10°7°, 
Kolmogorov-Smirnov test; Fig. 2D) (33), and 
they have other distinguishing sequence fea- 
tures (e.g., enrichment for Kozak consensus 
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Fig. 3. Short IncRNA CDSs encode functional microproteins. (A) Rescue 
of IncRNA CDS knockout growth phenotypes by ectopic expression of the 
transcript encoding the peptide, as well as controls in which the initiating 
start codon is removed (Astart codon). Error bars represent standard 
deviation of triplicates. P < 0.05 for all comparisons between knockout (KO) 
and KO + rescue. (B to D) Microscopy images and volcano plots of the co-IP 
MS of three example IncRNA-encoded microproteins tagged with mNG11, 
expressed ectopically (in the native transcript context) in a HEK293T cell line 


sequence) (fig. S10). However, the noncanonical 
CDSs, on average, have lower PhyloCSF scores 
compared with canonical proteins (fig. S2B). 
Finally, sgRNAs targeting ORF hits versus non- 
hits have indistinguishable off-target and on- 
target scores (fig. S9B) (32). We then performed 
validation follow-ups with individual sgRNAs, 
which recapitulated the growth phenotypes from 
our genome-scale screen (fig. S8D). Sequencing 
of the targeted genomic regions revealed insert- 
ions and deletions (indels) of <50 base pairs 
(bp) (fig. S9, C and D). Together, these analyses 
independently support the conclusion that our 
screen phenotypes result specifically from the 
disruption of the target ORFs. 

To survey function of the noncanonical 
CDSs at scale, we combined CRISPR screening 
with single-cell RNA sequencing (Perturb-seq) 
(34, 35). Disruptions of the various non- 
canonical CDSs resulted in broad and diverse 
changes in RNA-sequencing profiles across a 
variety of critical pathways, suggesting that 
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the candidate CDSs play diverse cellular roles 
(fig. S11). As an example, disruption of the 
CDS on LINCO0998 resulted in differentially 
expressed genes related to glycosylation (P < 
10°”°), suggesting a function at the Golgi or 
endoplasmic reticulum (ER). The transcrip- 
tional phenotype also allowed us to function- 
ally profile CDSs that are not essential for 
robust growth (fig. S11C). Furthermore, we 
found that CRISPR-targeted transcripts did 
not show detectable changes in abundance 
that might result from processes such as 
nonsense-mediated decay, which indicates 
that the phenotypes we observed were not 
caused by decreasing the abundance of the 
entire transcript (fig. S11D). Thus, similar to 
screens for essential protein-coding genes 
(26, 31), our screen for noncanonical CDSs 
required for robust cell growth underesti- 
mated the true number of functional CDSs 
in the genome. This finding further under- 
scores the pervasiveness of functional, un- 
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expressing mNG1-10. Green is mNG, red is the indicated organelle 
localization, and blue is Hoechst 33342, which stains for the nucleus. Scale 
bar dimensions are labeled. Statistically significant interactors are shown 

in the top, right corner of the volcano plots. Thick threshold line is 1% FDR 
(false discovery rate), and the thin threshold line is 5% FDR. The bait 

(the tagged peptide) is labeled in blue. The interactors are colored 
according to their functional groups. (E) IncRNA-encoded microproteins are 
uncharacterized proteins that may play important regulatory roles in cells. 


annotated CDSs in the genome that affect a 
wide range of cellular activities. 

We next explored the functional role of the 
peptides encoded by the noncanonical CDSs 
identified from our screen, focusing first on 
IncRNA CDSs. For seven IncRNAs, we ectopi- 
cally expressed the transcript encoding for the 
peptide and found, in all cases, that knockout- 
induced growth defect was partially or com- 
pletely rescued. This rescue was abrogated by 
the removal of the initiating start codon (Astart 
codon) (Fig. 3A), which suggests an essential 
role of the peptide itself in cell growth. To 
further investigate the specific functions of 
the noncanonical microproteins, we adopted 
a split-fluorescent protein approach using 
mNeonGreen (mNG), in which we fused each 
peptide with a minimally disruptive 16-aa tag 
(mNGI11). Coexpression of the tagged peptide 
with the remainder of the mNG protein 
(mNGI1-10) results in a fluorescence signal 
upon complementation (36, 37). This creates 
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interactors are shown in blue. (€ to E) Example volcano plots of 
co-IP MS in (B) reveal statistically significant interactors with UORF 
peptides. Threshold line is 1% FDR. The bait (the tagged peptide) is 
labeled in blue. For microscopy in (C) and (D), the main, canonical 
protein tagged with mCherry (red) is coexpressed. For (E), the mNG11- 
tagged MIEF1 uORF peptide (green) localizes to the mitochondria 
(red). (F) Volcano plot of co-IP MS from endogenously mNG11-tagged 
HAUS6 UORF. For microscopy, the mNGI1-tagged uORF is expressed 
alone ectopically (green), and the canonical HAUS6 tagged with 
mCherry (red) is coexpressed. (G) Percent change for each 
cell cycle state for HAUS6 knockout (KO) and HAUS6 UORF KO, 
compared with control cells. (H) Transcriptome response of the 
EF1 uORF KO compared with the main CDS KO from Perturb-seq. 
(I) Quantification of mitochondria morphology upon MIEF1 uORF 
peptide overexpression and knockout, as well as rescue of knockout 
phenotype. Representative microscopy images of the different 
mitochondria morphologies are shown to the right. (J) Possible 
model of uORF peptide functions and regulatory roles in cells. 
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both a fluorescent reporter to detect stable 
expression and cellular localization and a 
handle for coimmunoprecipitation (co-IP) 
and MS to define interaction partners (36) 
(fig. S12A). We probed the functions of six 
essential IncRNA CDSs and found that five 
of them formed specific complexes that were 
consistent with their subcellular localization. 
For example, the 62-aa peptide encoded by 
IncRNA RPI1_469AI15.2 specifically localized 
to the mitochondria. The peptide has a pre- 
dicted transmembrane domain and coimmu- 
noprecipitates with the cytochrome c oxidase 
(COX) complex and the mitochondrial Pro- 
hibitin complex (Fig. 3B). Moreover, the 70-aa 
peptide encoded by RPII-84A1.3 localizes to the 
plasma membrane and interacts with various 
cell surface proteins (Fig. 3C). Third, the 59-aa 
peptide encoded by IncRNA LINC00998, which 
contains two predicted transmembrane do- 
mains, localizes specifically to both the ER and 
Golgi and coimmunoprecipitates with lyso- 
somal and vesicular transport proteins (Fig. 
3D). Finally, the 55-aa peptide encoded on 
TOPORS-AS] and the 124-aa peptide on RPII- 
132A1.4 also form functional complexes con- 
sistent with their cellular localization (fig. $12, 
C and D, and fig. S13). Consistent with prior 
studies (5-18), these examples demonstrate 
that IncRNAs can encode uncharacterized 
proteins, and they highlight the need to fully 
extend the annotation of IncRNAs and the 
proteome. 

We next explored the functional effects of 
uORF translation, which is complicated by 
the fact that phenotypes can, in principle, be 
mediated by the peptide product (24, 38-41), 
the effect of uORF translation on expression 
of the main, canonical CDS (20), or both. To 
distinguish between these possibilities, we 
first separately tagged the uORF and the 
main CDS and used Western blot to confirm 
the independent expression of uORF pep- 
tides from the canonical protein (fig. S14). 
Furthermore, we established that ectopic ex- 
pression of a transcript encoding only the 
uORF peptide could at least partially rescue 
the growth phenotype caused by disruption 
of the endogenous uORF. In all cases this 
rescue is dependent on the initiating start 
codon in the ectopically expressed message, 
which demonstrates that the rescue is the re- 
sult of production of the expressed peptide 
(Fig. 4A). Consistent with this, in all cases we 
tested, deleting the start codon for the uORFs 
only minimally increased (~20% to 60%) the 
expression of the main CDS. This suggests that 
the growth defect observed is mediated by 
the peptide and not by increased expression 
of the canonical protein (fig. S14, E and F). 
Taken together, these findings establish that 
uORFs could function through the peptide 
they produce independently of any cis-regulatory 
effects. 
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To explore the functions of uUORF-encoded 
microproteins, we examined their localization 
and protein binding partners by tagging the 
uORF peptides with mNGI11. Out of the 10 ORF 
peptides further tested by co-IP MS, we failed 
to detect statistically significant interaction 
partners for three of the tagged peptides. Two 
peptides, encoded by the uORFs of TBPL] and 
ARLSA, localize generally to the cytoplasm, 
whereas the main CDS proteins exhibit different 
cellular localization patterns. Consistent with 
our observed cellular localizations, these two 
uORF peptides specifically immunoprecipitate 
proteins with functions that are independent 
of the main CDS protein (Fig. 4, B and C, and 
fig. S12). Thus, these uORF peptides and their 
main CDS protein have independent functions. 

We found that 5 of the 10 uORF peptides 
colocalized and formed a stable physical com- 
plex with the downstream-encoded, canonical 
protein on their shared mRNA. These include 
MIEF1, DDIT3, FBXO9, HMGA2, and HAUS6 
(Fig. 4, B, D, and E, and fig. S12). In all cases, 
we expressed the tagged peptides in their na- 
tive transcript context but without the down- 
stream CDS, thereby eliminating the possibility 
of stop codon read-through. We further con- 
firmed this interaction by co-IP of the canon- 
ical protein and immunoblotting for the uORF 
peptide (fig. SI2F), as well as with endoge- 
nously tagged clonal lines (fig. S15 and Fig. 4F). 
This physical interaction between the proteins 
encoded by the uORF and the canonical CDS 
on the same transcript is notable (39, 42, 43) 
because it implies an additional layer of regu- 
lation beyond the propensity of UORFs to mod- 
ulate translation of downstream CDSs. 

Next, we further investigated the function of 
uORF-expressed microproteins in HAUS6 and 
MIEF'1. In both cases, disrupting the uORF led 
to minimal increase in the expression of the 
main CDS protein, and the ectopic expression 
of a peptide-encoding transcript rescued the 
knockout-induced growth phenotype (Fig. 4A 
and fig. S14). mNG11-tagged HAUS6 uORF ex- 
pressed from its endogenous locus efficiently 
pulled down key components of the HAUS6 
complex, localized to the centrosome, and 
knockout of the uORF caused cells to arrest 
in the G1 stage, consistent with the role of 
HAUS6 microtubule attachment to the kine- 
tochore and central spindle formation (Fig. 4, 
F and G, fig. S12, and fig. S15). Similarly, the 
MIEF1 uORF peptide localized to the mitochon- 
dria, consistent with the localization of the 
MIEF' protein (Fig. 4E), which regulates mito- 
chondrial fission and fusion (44). The MIEF1 
uORF peptide knockout induced differential 
expression of mitochondrial fusion and fission 
genes, with a transcriptional signature that 
was distinct from that seen in the knockout 
of the MIEF1 protein (Fig. 4H). We observed 
that overexpression of the MIEF1 uORF pep- 
tide alone induced a fragmented mitochon- 
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drial phenotype (increased fission), whereas 
a clonal knockout of the MIEF1 uORF (with 
the sequence disrupted but nonetheless pre- 
serving an upstream ORF; see fig. S15) resulted 
in a tubular and more elongated mitochon- 
drial phenotype (increased fusion). Notably, 
this knockout morphology could be rescued 
by the exogenous expression of the MIEF1 
uORF peptide (Fig. 41). Together, our results 
indicated a possible role of the uORF-encoded 
peptide in regulating the downstream-encoded 
protein, thereby challenging the monocistronic 
assumption about mammalian genomes. We 
speculate that this type of genomic architecture 
may be general, opening the doors to investi- 
gation of the cooperative and regulatory nature 
of bicistronic human mRNAs. Indeed, a num- 
ber of stress-regulated alternate translation 
initiation factors can modulate translation 
initiation site choice and uORF usage, which 
suggests that regulation of bicistronic expres- 
sion could play roles in both normal biology 
and diseases states (21, 45). 

We described a strategy that combines 
ribosome profiling, MS-based proteomics, 
microscopy, and CRISPR-based genetic screens 
to discover and characterize widespread trans- 
lation of functional microproteins and define 
the protein-coding potential of complex ge- 
nomes. We identified a subset of ncRNAs that 
can encode stable, functional proteins, which 
suggests that they may be misannotated RNAs 
or potentially have dual roles at the RNA and 
protein levels. Furthermore, we provided ex- 
amples of uORFs encoding functional peptides, 
highlighting the diverse cellular roles that 
uORFs may play beyond translational control. 
We also identified uORF-encoded peptides 
binding to the downstream-encoded protein 
on the same mRNA. Thus, our data highlight 
a previously unappreciated complexity of the 
functional mammalian proteome, as well as 
the full spectrum of antigens presented by the 
HLA system. 
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PE/PPE proteins mediate nutrient transport across 
the outer membrane of Mycobacterium tuberculosis 


Qinglan Wang?, Helena I. M. Boshoff?, Justin R. Harrison’, Peter C. Ray”*, Simon R. Green’, 


Paul G. Wyatt?, Clifton E. Barry III+4* 


Mycobacterium tuberculosis has an unusual outer membrane that lacks canonical porin proteins for the 
transport of small solutes to the periplasm. We discovered that 3,3-bis-di(methylsulfonyl)propionamide 
(3bMP1) inhibits the growth of M. tuberculosis, and resistance to this compound is conferred by mutation 
within a member of the proline-proline-glutamate (PPE) family, PPE51. Deletion of PPE51 rendered 

M. tuberculosis cells unable to replicate on propionamide, glucose, or glycerol. Growth was restored 
upon loss of the mycobacterial cell wall component phthiocerol dimycocerosate. Mutants in other 
proline-glutamate (PE)/PPE clusters, responsive to magnesium and phosphate, also showed a 
phthiocerol dimycocerosate—dependent growth compromise upon limitation of the corresponding 
substrate. Phthiocerol dimycocerosate determined the low permeability of the mycobacterial outer 
membrane, and the PE/PPE proteins apparently act as solute-specific channels. 


ycobacterium tuberculosis thrives in 
the hostile environment of the human 
lung despite an onslaught of host re- 
sponses that prevent the growth of near- 
ly all other bacteria (7). M. tuberculosis 
survives because it is covered by a highly im- 
permeable envelope and is able to manipulate 
the host defense mechanisms (2). Outside of the 
cytoplasmic membrane is the mycobacterial 
envelope, which consists of a complex hetero- 
polymer composed of peptidoglycan covalent- 
ly attached to arabinogalactan terminated by 
mycolic acids, which are specific to myco- 
bacteria (3). Cryo-electron microscopy shows 
that the outer lipid layer of the mycobacterial 
outer membrane is ~8 to 9 nm in width (4, 5). 
No canonical porin protein has been identi- 
fied in slow-growing mycobacteria, such as 
M. tuberculosis, although a major porin (MspA) 
has been identified in fast-growing mycobac- 
terial species, such as M. smegmatis (6). How- 
ever, porin-like activity has been observed in 
membrane preparations from slow-growing 
mycobacteria (7, 8), but no protein determinants 
of this activity have been identified. Express- 
ion of the porin from fast-growing mycobac- 
teria in M. tuberculosis substantially reduces 
virulence and enhances antibiotic suscepti- 
bility (9, 10). A canonical porin is incompatible 
with the need for a permeability barrier that is 
extreme enough to allow the intracellular sur- 
vival of M. tuberculosis. 
As part of our drug discovery efforts, we 
identified a small-molecule whole-cell inhibi- 
tor from a library of agrichemicals. This simple 
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molecule, 3,3-bis-di(methylsulfonyl)propionamide 
(8bMP1), displayed a modest minimum inhibitory 
concentration (MIC) of ~3 ug/ml independ- 
ent of growth media and was highly bacteri- 
cidal, reducing starting colony forming units 
(CFU) by 3 logs after 4 days and achieving 
near-sterilization after 8 days of incubation 
(Fig. 1, A and B). To identify the mechanism of 


A 


action of this compound, we selected for resist- 
ant mutants by plating wild-type M. tuberculosis 
cells on media containing 10-fold the MIC of 
3bMPI, and we observed mutants growing 
after 5 weeks of incubation. The resistance of 
these mutants was confirmed in liquid cul- 
tures, showing an MIC of 25 ug/ml in 7H9/OAD 
medium containing 0.05% Tween 80 [a deter- 
gent typically added to tuberculosis (TB) cul- 
tures to disperse the bacteria] and a much 
higher MIC in the same medium without Tween 
80 (Fig. 1, C and D). Whole-genome sequencing 
of 10 clones from this experiment revealed that 
all had scattered mutations in a single pro- 
tein, proline-proline-glutamate (PPE) family 
protein 51 (PPE51) (fig. SIA). Complementation 
of these mutants with a wild-type copy of the 
ppesl gene fully restored sensitivity to 3p>MP1 
(Fig. 1, C and D), which is consistent with what 
were apparently loss-of-function mutations in 
ppesl (fig. SIA). 

The PE/PPE family of proteins, named for 
the homologous proline-glutamate (PE) or 
proline-proline-glutamate (PPE) repeated 
regions in their N terminus, comprises 169 or- 
thologs. These proteins represent nearly 10% 
of the M. tuberculosis genome, and many are 
substrates for the type VII secretion systems of 
M. tuberculosis (11). Because they are secreted 
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Fig. 1. Loss-of-function mutations in ppe51 result in the resistance to compound 3bMPL1. (A) The chemical 
structure of 3bMP1. (B) Logarithmically growing M. tuberculosis cells were exposed for 4 and 8 days to 3bMP1 

at 1x, 5x, and 10x MIC values. Rifampicin (Rif) and dimethyl sulfoxide (DMSO) were used as positive and 

negative controls, respectively. Data are generated from two independent experiments and shown as mean + SD 
(*P < 0.05, **P < 0.01, versus DMSO control, unpaired t test). (© and D) Susceptibility of M. tuberculosis strains to 
SbMP1 in 7H9/OAD media with (C) or without (D) 0.05% Tween 80. Bacterial growth was quantified using an 
alamarBlue-based assay. R1 contains a G18D (Gly"®—Asp) mutation in the PPE51. R2 contains insertion sequence 
IS6110 at codon 247 in the ppe51 gene. Data are representative of two independent experiments, both done as 
technical duplicates, and error bars represent SD [in (C): P = 0.0007, wild type (wt) versus R1; P = 0.0107, wt versus 
R2; no significant difference among wt, Rl+ppe51, and R2+ppe5l; in (D): P < 0.0001, wt versus R1; P < 0.0001, 
wt versus R2; no significant difference among wt, Rl+ppe51, and R2+ppe51; data were analyzed using two-way 
analysis of variance (ANOVA) of matched values]. 
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and numerous, they are commonly thought to 
play a role in interacting with the human im- 
mune system (12). However, these proteins do 
not appear to show sequence variation driven 
by immune recognition (13). The PE/PPE pro- 
teins are clearly associated with the outermost 
layer of the mycobacterial cell envelope (14-16). 
Many PE/PPE proteins have been implicated 
in various aspects of the pathogenesis of TB 
disease, but their molecular mechanisms have 
not been identified (77). Two recent studies 
have linked individual PPE proteins to heme 
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and glycerol acquisition, yet the biological func- 
tion of many of these proteins remains elusive 
(16, 18). 

Studies on analogs of 3bMP1 revealed that 
the propionamide core was essential for anti- 
tubercular activity (table S1). To understand 
whether resistance resulted from restricting 
drug uptake, we cultured M. tuberculosis in 
media containing only propionamide, a sub- 
structure of 3bMPI, as a carbon source. As ex- 
pected, given the metabolic plasticity of this 
organism for growth on lipid substrates (19), 
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wild-type M. tuberculosis was capable of florid 
growth on propionamide. The PPE51 mutant 
resistant to 3bMP1 was, however, incapable of 
growth on propionamide (fig. S1B). In con- 
trast, the longer lipid substrate hexanamide 
supported the growth of both strains (fig. SIC). 
These results suggested that the resistance of 
this mutant might be due to impaired uptake 
of 3bMP1. We also screened this mutant for 
growth on other carbon sources and found 
that the mutant was also unable to optimally 
use glycerol and glucose as its sole carbon 
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Fig. 2. PPE51 is required for uptake of glycerol and glucose by M. tuberculosis. 
(A and B) Growth of M. tuberculosis strains on 0.2% glycerol (A) or 0.2% glucose 
(B) as sole carbon source. Data are from three independent experiments and 
represent mean + SD. (C) Thin-layer chromatographic analysis of PDIM production by 
M. tuberculosis strains. An M. tuberculosis clinical isolate, HN878, was included as a 
control. OD¢oo, optical density at a wavelength of 600 nm. (D and E) [“C]-glycerol and 
[“C]-glucose uptake by M. tuberculosis strains. Uptake experiments were done as 
two independent biological replicates, and mean values are shown with SD [in (D): 
#P < 0.001, wt versus Appe5l at different time points; *P < 0.05, Appe5! versus 
Appe51+mspA at the last time point; in (E): *P < 0.05, **P < 0.01, ns = not 
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significant, wt versus Appe51 at different time points; *P < 0.05, Appe51 versus 
Appe5l+mspA at the last time point, unpaired t test]. (F) Western blot analysis of 
whole-cell lysates (WCL), water-soluble supernatant (SN), membrane-associated 
pellet (P), and culture filtrate fractions (CF) obtained by ultracentrifugation of 

M. tuberculosis—expressing His-tagged PPES1. GroEL (cytosol) ESAT-6 and CFP-10 
(secreted) and PrrB (inner membrane) were used as fractionation controls. KD, 
kilodaltons; ESAT-6, early secreted antigen of tuberculosis 6 kDa. (G) Cell surface 
protein biotinylation. Proteins of known subcellular localization served as control for 
cytosolic proteins (GroEL), inner membrane-associated proteins (PrrB), and surface- 
accessible proteins (spmT). (H) Cell surface accessibility of PPE51 by flow cytometry. 
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sources (fig. S1, D and E). Uptake assays for 
4C-glycerol and “C-glucose showed that the 
mutant was deficient because it could not 
transport glycerol and glucose, whereas both 
the parent and the complemented strain could 
(fig. S1, F and G). 

To confirm that loss of PPE51 function was 
responsible for these phenotypes, we created a 
site-specific deletion of the ppe5/ gene (fig. S2A). 
This knockout did not recapitulate the pheno- 
type of the drug-selected mutant—it showed 
less resistance to 3bMP!1 in the medium with- 
out Tween 80 and was able to use glycerol and 
glucose as its sole carbon sources, unlike the 
drug-selected mutant (fig. $2, B and C, and 
Fig. 2, A and B). After further analysis, we 
discovered that the mutant had lost the ability 
to synthesize an abundant cell wall-associated 
nonpolar wax phthiocerol dimycocerosate 
(PDIM), containing a single nucleotide dele- 
tion (618delA) in the fadD26 gene, which en- 
codes a fatty acid-coenzyme A ligase in the 
PDIM biosynthesis pathway (Fig. 2C and fig. 
S1H). PDIM loss is a common occurrence 
during in vitro culture of M. tuberculosis (20). 
PDIM makes up 46% of the total lipids of 
M. tuberculosis, amounting to 11% of the total 
weight of dried bacteria (27). When we reselected 
a PDIM-positive deletion mutant of ppe5/, the 
deletion mutant displayed a level of resistance 
to 3bMP1 similar to that of the drug-selected 
mutant, as well as an inability to grow on 
glycerol and glucose (fig. S2, B and C, and Fig. 
2, A and B). No obvious growth difference 
was observed in standard rich medium or in 
media containing oleic acid or hexanamide 


Fig. 3. PPE51 interacts with PE19 in M. tuberculosis. 
(A) Loss of pel9 in M. tuberculosis leads to resistance 
to 3bMPI1. (B) The pel9 gene is negatively regulated by 
the antisense expression of pel8. Antisense expression of 
pel8 was induced using 100 ng/ml anhydrotetracycline 
(aTC). In (A) and (B), experiments were performed in 
7H9/0AD medium containing 0.02% tyloxapol. Data are 
generated from at least two independent experiments, 
both done as technical duplicates, and error bars 
represent SD [in (A): P < 0.0001, wt versus Ape19; in (B): 
P = 0.0008, wt versus wttpel8anti (+aTC); P = 0.0082, 
wt+pel8anti (-aTC) versus wt+pel8anti (+aTC); two- 
way ANOVA of matched values]. (C) In vivo interaction 
between PPE51 and PE19. M. tuberculosis coexpressing 
His-tagged PPE51 and HA-tagged PE19 (or PE25) were 
cross-linked by DSP. PPE51 was precipitated using 
nickel-nitrilotriacetic acid resin, and coprecipitated PE 
proteins were visualized by Western blot (WB) using 
anti-HA antibodies. DTT, dithiothreitol; IP, immuno- 
precipitation. (D) Export of PPES1 in AppeS! and 
Appe25-pel9 mutants. The Apped51 and Appe25-pel9 
cells expressing His-tagged PPE51 were grown in 
detergent-free Sauton’s medium. Surface proteins 
were extracted by 1% n-octyl-B-p-glucopyranoside 
(OBG). The presence of PPES1 in WCL, OBG extracts, 


as sole carbon sources, indicating that the 
PDIM-positive pped5I knockout mutant did 
not have a general growth defect (fig. S2, D 
to F). The PDIM-positive pped5I deletion 
mutant also showed a defect in uptake of 
C-labeled glycerol and glucose (Fig. 2, D 
and E). Restoration of PDIM production in 
the original ppe5I deletion mutant was 
achieved by providing a wild-type copy of 
fadD26, which resulted in an inability to 
grow on glycerol and glucose and showed 
defective uptake of “C-labeled glycerol and 
glucose (Fig. 2, A to E). Notably, both growth 
and uptake defects in the PDIM-positive ppe51 
knockout mutant were complemented by ex- 
pression of wild-type ppe5I or partially com- 
plemented by expression of the porin gene 
mspA of M. smegmatis (Fig. 2, A to E). Together 
with the observation that loss of PDIM re- 
scued growth and uptake defects, this result 
indicates that PPE51 was localized to the outer 
membrane of M. tuberculosis and had a simi- 
lar channel function to MspA. 

Subcellular fractionation of M. tuberculosis 
cells expressing His-tagged PPE51 showed 
that this protein was membrane-associated 
(Fig. 2F). To distinguish between the inner and 
outer membrane proteins, we performed a 
cell-surface protein biotinylation assay—which is 
based on surface detection of proteins in whole 
cells using antibodies—using a membrane- 
impermeable biotinylation reagent (22) and 
flow cytometry analyses. As shown in Fig. 2, 
G and H, His-tagged PPE51 was biotinylated 
with amine-PEGI1-biotin and was detected 
with a monoclonal anti-His antibody in flow 


cytometry experiments, whereas an inner mem- 
brane protein (PrrB), an inner membrane- 
associated protein (MbtG), and a cytosolic 
protein (GroEL) were not detected. The func- 
tionality of His-tagged PPE51 was confirmed 
by showing that it restored sensitivity to 3bMP1 
(fig. S2G). These experiments indicate that 
PPE51 is localized to the outer membrane of 
M. tuberculosis. 

The PE/PPE proteins are thought to assem- 
ble as heterodimers. The first reported crys- 
tallographic structure of a PE and PPE protein 
heterodimer (PE25 and PPE41) revealed that 
the proteins are tightly associated via hydro- 
phobic regions along the long axis of a four-helix 
bundle (23). The similarity of this heterodimer 
to the structure of an ESAT-6 secretion system 1 
(ESX-1)-secreted protein in M. tuberculosis, 
EspB, has been invoked to explain the substrate 
specificity of the type VII secretion system, 
which also transports PE/PPE proteins across 
the cytoplasmic membrane (24). EspB oligo- 
merizes into heptamers that form pores in the 
phagosomal membrane of the host cell during 
infection by M. tuberculosis. Our data were con- 
sistent with a similar porin-like role for PPE51, 
in combination with a PE protein, embedded 
in the PDIM (fig. S4). 

To identify the partner PE protein for 
PPE51, we performed saturating transposon 
mutagenesis on wild-type M. tuberculosis, se- 
lecting for resistance to 3bMPI1. In addition 
to hits in ppe51, we identified transposon hits 
in pel9 that conferred resistance (table S2). 
We subsequently constructed a series of mu- 
tants lacking each of the individual genes in 


and culture filtrate (CF) was examined by anti-His antibodies. 
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Fig. 4. PE20/PPE31 and 
PE32/PPE65 are required 
by M. tuberculosis for 
growth during Mg2* and 
PO;~ restriction. Growth of 
M. tuberculosis strains in 
modified detergent-free 
Sauton'’s medium containing 
10 uM Mg?* at pH 7.0 (A), 
pH 6.5 (B), and pH 6.2 (C). 
Media were buffered 

with 50 mM MOPS [3-(N- 
morpholino)propanesulfonic 
acid]. After growth in 

24-well plates for 15 days 

at 37°C, cells were treated 
with 0.1% Tween 80 over- 
night to prepare homoge- 
neous cell suspensions, and 
cell growth was quantified 
by alamarBlue assay. Data 
are from technical triplicates 
corresponding to fig. S6 o 
(*P < 0.05, **P < 0.01, 

***P < 0.001, #P < 0.0001, 
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unpaired t test). (D) M. tuberculosis requires ppe25-pel9 and pe32-ppe65 for efficient growth during phosphate limitation. Growth of M. tuberculosis strains in 
modified Sauton’s medium containing 30 uM Pi buffered with 50 mM MOPS. A final concentration of 100 ng/ml aTC was used to knock down the expression of 
pe32-ppe65 in the ppe25-pel9 mutant. Data are generated from three independent experiments and shown as mean + SD. RFU, relative fluorescence units. 


this locus, as well as the entire ppe25-pe19 
locus, noting the high sequence identity be- 
tween ppe25 and ppe27 genes (86.2%) and 
pel8 and pel9 genes (78.4%) in this locus 
(fig. S3, A and C). As shown in Fig. 3A, only 
the Appe25-pel9 and Apel9 mutants showed 
resistance to 3bMP1. The growth defect of 
the Appe25-pel9 mutant was also rescued by 
loss of PDIM (fig. S3, B and D). PEI8 and 
PE19 are highly conserved (89% identity), 
and extra-chromosomal expression of either 
PE18 or PE19 restored the susceptibility of 
the Appe25-pel9 mutant to 3bMP!1 and re- 
scued the growth of the Appe25-pel9 mu- 
tant on glycerol (fig. S3D and table S3). It 
is known that pel8 produces antisense trans- 
cripts naturally (25), indicating that the ex- 
pression of PE19 is negatively regulated by 
the antisense expression of pel8. Overex- 
pression of the antisense transcript of peZ8 in 
wild-type H37Rv caused resistance to 3bMP1 
(Fig. 3B), which is consistent with this regu- 
latory role. To confirm the interaction be- 
tween PPE51 and PE19 biochemically, we 
coexpressed His-tagged PPE51 and HA-tagged 
PE19 in M. tuberculosis cells and performed 
a coimmunoprecipitation assay with the 
cross-linker dithiobis(succinimidyl propionate) 
(DSP). As expected, PE19 coprecipitated with 
PPE51, and PE25 showed only slight copre- 
cipitation with PPE51, likely because the 
N-terminal half of PE25 is nearly 50% iden- 
tical to PE19 (Fig. 3C). Because PE/PPE pro- 
teins are usually exported as pairs by ESX 
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systems, we examined the export of PPE51 
in the ApeJ9 mutant and found that export 
of PPE51 was blocked by the deletion of pel9 
(Fig. 3D). 

We next asked whether this porin-like func- 
tion was specific for the PE19/PPE51 pair or if 
other PE/PPE pairs had similar functions. The 
genomic locus encoding PE20-PPE33 has been 
shown to be up-regulated during Mg?” limita- 
tion. This locus clustered with the mgtC gene, 
which is known to be involved in virulence and 
is also up-regulated upon magnesium depriva- 
tion (26). Furthermore, a magnesium riboswitch 
(Mbox) was found upstream of the pe20-ppe33 
locus (27), indicating that these proteins play a 
role in magnesium homeostasis. We therefore 
generated Ape20 and Appe3I-ppe33 mutants 
in M. tuberculosis in both a PDIM-positive and 
a PDIM-negative background (fig. S5, A and B, 
and fig. S8A). As expected, both PDIM-positive 
mutants showed a growth defect in Mg?*- 
limiting media, especially at mildly acidic pH, 
while the PDIM-negative mutants grew sim- 
ilarly to wild type (Fig. 4, A to C, and figs. S6, 
87, and S8, D to F). The phenotype of Appe3!I- 
ppe33 could be complemented by express- 
ing ppe31-ppe33, ppe3l alone, or mspA from 
M. smegmatis, but not by expressing ppe32 
or ppe33 (Fig. 4, A to C, and fig. S6). PE20 
and PPE31 were also found to associate by 
coprecipitation (fig. S8B). Together, these re- 
sults suggest that PE20/PPE31 form a com- 
plex, possibly with other proteins, to mediate 
Me** transport across the outer membrane. 
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PE20 was also found to interact with PPE33 
(fig. S8B); however, PPE33 did not affect Mg”* 
uptake (Fig. 4, A to C, and fig. S6). Further 
studies will be needed to identify the substrate 
transported by PE20/PPE33. 

Finally, the genomic loci ppe25-pel9 and 
pe32-ppe65 are known to be strongly up- 
regulated by starvation for inorganic phos- 
phate (28). First, we tested the growth of a 
Appe25-pe19 mutant in defined media con- 
taining various concentrations of phosphate. 
The Appe25-pel19 mutants were impaired 
slightly for growth at 100 uM phosphate, and 
at 30 uM, growth was severely restricted (fig. 
S9). We were unable to obtain a single knock- 
out of the pe32-ppe65 locus or the double 
knockout of the pe32-ppe65 locus in the 
Appe25-pel9 background that retained PDIM. 
Therefore, we knocked down the expression 
of pe32 or ppe6os in the Appe25-pel9 mutant 
using CRISPR interference. As shown in Fig. 
4D and fig. S10, silencing the expression of 
pe32-ppe6s in Appe25-pel9 resulted in a pro- 
nounced growth defect in phosphate-limiting 
media, whereas the PDIM-negative Appe25- 
pel9/Ape32-ppe65 double knockout mutant 
grew similarly to the wild type. 

In slow-growing M. marinum, the ESX-5 
secretion system is essential for PDIM outer- 
membrane permeability, but the substrates 
it transports were not identified (29). The 
ESX-5 secretion system maps closely to the 
ppe25-pel9 gene cluster, and it seems likely 
that the observed phenotype was the result of 
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a lack of transport of PE19. The phylogeny and 
distribution of the PE/PPE proteins is such 
that all slow-growing mycobacteria seem to 
have an expanded repertoire of heterodimer 
combinations, and these species lack any de- 
tectable homolog of the porin seen in the fast- 
growing mycobacteria that have few PPEs 
(table S4). Our data, coupled with the dis- 
tribution of PE/PPE among the mycobacteria, 
suggest that slow-growing mycobacteria have 
co-opted at least some of these proteins as 
small molecule-selective channels in place of 
porins. The need for these porins is intimately 
associated with the production of PDIM. De- 
spite the fact that PDIM accounts for as 
much as 46% of the total lipids present in 
M. tuberculosis and for 11% of the total bac- 
terial mass in culture (27), its molecular role in 
the envelope has been unclear. PDIM, like the 
PE/PPE proteins, is also restricted in distribu- 
tion to the slow-growing mycobacteria, and 
our results indicate that its role is to enhance 
the impermeability of the mycobacterial cell 
wall. Fully 1% of the coding capacity of the 
organism is devoted to the biosynthesis of 
PDIM, and PDIM-deficient mutants are highly 
attenuated in virulence (30, 37). PDIM is easily 
lost during in vitro culture yet never absent 
from freshly isolated clinical strains (20). Our 
results suggest that PDIM acts within the 
outer leaflet of the mycobacterial cell envelope 
to enhance its impermeability, presumably 
thereby directly allowing growth in the hostile 
environment of the host. At least some of the 
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PE/PPE proteins appear to act as solute- 
selective pores, allowing M. tuberculosis access 
to nutrients required to proliferate in this 
environment. Understanding the transport 
properties and selectivity of these particular 
channels would enable rational approaches to 
optimize the uptake properties of newly de- 
signed drugs. 
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Sequencing metabolically labeled transcripts in 
single cells reveals mRNA turnover strategies 


Nico Battich*, Joep Beumer, Buys de Barbanson, Lenno Krenning}, Chloé S. Baront, 
Marvin E. Tanenbaum, Hans Clevers, Alexander van Oudenaarden* 


The regulation of messenger RNA levels in mammalian cells can be achieved by the modulation of 
synthesis and degradation rates. Metabolic RNA-labeling experiments in bulk have quantified these rates 
using relatively homogeneous cell populations. However, to determine these rates during complex 
dynamical processes, for instance during cellular differentiation, single-cell resolution is required. 
Therefore, we developed a method that simultaneously quantifies metabolically labeled and preexisting 
unlabeled transcripts in thousands of individual cells. We determined synthesis and degradation rates 
during the cell cycle and during differentiation of intestinal stem cells, revealing major regulatory 
strategies. These strategies have distinct consequences for controlling the dynamic range and 
precision of gene expression. These findings advance our understanding of how individual cells in 
heterogeneous populations shape their gene expression dynamics. 


ammalian cells use diverse strategies 

to regulate mRNA levels by controlling 

their synthesis and degradation rates 

(, 2). High synthesis or degradation 

rates allow cells to rapidly respond to 
extracellular and intracellular signals (2, 3), 
whereas low degradation rates allow them 
to integrate transcriptional information over 
time (4). The extent to which mammalian cells 
exploit different regulatory strategies during 
complex dynamical processes such as cell cycle 
progression or organ formation remains un- 
clear. This is partially due to the difficulty in 
distinguishing these strategies when only tran- 
script levels are measured. 

Additionally, the study of these regulatory 
strategies in bulk assays is hindered by the 
presence of heterogeneous cell types in the 
same tissue and unsynchronized cell states 
that result from the cell and circadian cycles 
(, 5-7). Advances in single-cell RNA sequenc- 
ing help to resolve cellular heterogeneity 
(8-14), yet do not provide insights into how 
the dynamic control of transcription and deg- 
radation leads to the observed expression pat- 
terns. The kinetic parameters that govern the 
life of mRNA can be measured by its metabolic 
labeling during transcription (J, 5, 6). Here, we 
demonstrate that mRNA labeled with 5-ethynyl- 
uridine (EU) can be detected in thousands of 
single cells by sequencing. We determined 
transcription and degradation rates in heter- 
ogeneous and unsynchronized cell popula- 
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tions and uncovered mRNA control strategies 
during the cell cycle of human cells and differ- 
entiation of mouse intestinal stem cells. 

To measure newly synthesized transcripts 
in single cells, we labeled mRNA by incubat- 
ing cells with EU, an analog of uridine that 
can be biotinylated with click chemistry, a 
method we have named “single-cell EU-labeled 
RNA sequencing” (ScEU-seq) (75). Briefly, after 
EU incubation, cells were dissociated, fixed, 
and permeabilized, and EU-labeled RNA was 
biotinylated in situ. We sorted single cells and 
generated mRNA/cDNA hybrids using poly-T 
primers containing a cell barcode, a unique 
molecular identifier (UMI), a 5’ sequencing 
adapter, and the T7 promoter. Cells were pooled, 
EU labeled and unlabeled hybrids were sep- 
arated using streptavidin magnetic beads, and 
libraries were generated for both fractions 
(Fig. 1A). The UMI counts for labeled mRNA 
were higher in EU-treated cells compared 
with dimethyl sulfoxide (DMSO)-treated cells 
or empty control wells, resulting in a high 
signal-to-noise ratio and low across-well cross- 
contamination rates (Fig. 1B). Only 12 of 11,848 
detected genes were affected by the EU treat- 
ment itself (fig. SIA). When we compared the 
total mRNA (unlabeled and EU-labeled UMIs; 
total UMIs) from EU-treated versus DMSO- 
treated RPE1-FUCCI cells, we found high re- 
covery efficiency (99.5 + 0.4%) of labeled MRNA 
(fig. S1B). After 120 min of EU incubation, the 
labeled mRNA fraction was on average 8.9 + 
0.7% (fig. SIB), which agrees with an expected 
average production of 8 to 10% of the tran- 
scriptome during a period of 2 hours in un- 
synchronized cells with a cell cycle length of 
~20 to 24 hours (16). 

To assess whether scEU-seq specifically en- 
riches transcripts synthesized during the EU- 
labeling window, we performed pulse and chase 
experiments varying either the EU incubation 
time or the length of a chase phase with uridine 
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(U) after EU treatment for 22 hours (Fig. 1C and 
fig. S1, C to G). As expected, we detected an in- 
crease in labeled UMIs as a function of the EU 
pulse length (Fig. 1D) and a decrease as a func- 
tion of the U chase length (Fig. 1E). We could 
still detect significantly higher UMI counts for 
very short labeling times (15 and 30 min) com- 
pared with the DMSO control (Fig. 1D and fig. 
S1G). In these short EU pulses, we found that 
labeled UMIs were enriched in transcripts that 
contained unspliced introns (Fig. 1F). 

Next, we incubated K562 cells at either 37°C 
or 42°C for a period of 45 min in the presence 
of EU or DMSO. The differential gene expres- 
sion signature upon heat shock was more 
pronounced in the fraction of EU-labeled 
mRNAs compared with the unlabeled frac- 
tion or with cells treated with DMSO (fig. S2). 
Consistently, the functional annotation anal- 
ysis (17) for up-regulated genes in the EU- 
labeled fraction revealed an enrichment for 
genes encoding heat shock or stress response 
proteins (fig. S2B). In addition, UMIs of these 
stress response genes represented a large per- 
centage in the EU-labeled samples but not in 
the DMSO-treated or unlabeled controls. 

Using data from the scEU-seq pulse and 
chase experiments, we can estimate the synthe- 
sis rate « and the degradation rate constant y 
for all detected transcripts. Furthermore, we 
can place individual cells along cell cycle or 
differentiation trajectories and thus infer how 
synthesis and degradation rates change over 
time. We first estimated « and y with high res- 
olution along the mammalian cell cycle. For 
each of the 5422 cells that passed quality con- 
trols in the pulse and chase experiments, we 
calculated the relative position along the cell 
cycle using the Geminin-GFP and the Cdt1- 
RFP signals from the FUCCI system (fig. S3, A 
and B) (18). The expression of known cell cycle 
markers followed the expected pattern relative 
to the Geminin-GFP and Cdt1-RFP (78), whereas 
the housekeeping gene HPRTI displayed con- 
stant expression during the cell cycle (Fig. 2A 
and fig. S3C). The level of labeled transcripts 
of cell cycle-controlled genes changed as a 
function of the cell cycle (fig. S4 and fig. $5), 
with the total UMI counts per cell approxi- 
mately doubling during one cell cycle (fig. S6, 
A and B). These results suggest that our es- 
timation of the cell cycle progression in single 
cells is accurate. 

To fit « and y to the experimental datasets, 
we simulated the dynamics of the pulse and 
chase experiments and quantified the accu- 
racy of the fitting procedures. The simulations 
defined the range of « and y values for which 
we can accurately determine these rates, and 
demonstrated that a model that does not as- 
sume steady-state dynamics of gene expres- 
sion is more fitting for our datasets (fig. S7). 

Next, we used the cell cycle progression es- 
timates to pool cells from different EU-labeling 
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time points and fitted « and y using the non- 
steady-state model (Fig. 2B and fig. S8) on 528 
genes that showed high expression changes 
during the cell cycle (fig. S9). Separate fits of 
the pulse and chase experiments with the non- 
steady-state model resulted in accurate y values, 
but simultaneous fits of both experiments fur- 
ther reduced uncertainty (Fig. 2B and fig. S10, 
A and B). We averaged the degradation rates 


A 1. EU incubation & biotinylation 2.FACS 3. reverse transcription 


over the cell cycle and obtained good agreement 
between our data and a published dataset 
(Spearman 7 = 0.585, fig. S10C) (5). The es- 
timated values of « and y for the 528 genes 
allowed us to predict their transcript levels at 
any position during cell cycle progression. Our 
predictions matched measurements of CEL- 
seq2 (19), an independent single-cell mRNA- 
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Fig. 1. Single-cell EU RNA sequencing (scEU-seq). (A) Schematics of the scEU-seq workflow. (B) Boxplots 
showing UMI counts of labeled mRNAs per well either containing single cells or left empty. Cell types are 
indicated. No cutoff was applied to the UMI counts per well. Signal-to-noise ratios are 49.74 (P = 2.4 x 10°*) 
for K562 cells, 19.69 (P = 2.4 x 10°’”) for FUCCI-expressing RPEI cells, and 32.48 (P = 1.7 x 10°”) for 
cells derived from mouse intestinal organoids; P values are from a Mann-Whitney U test. Estimated cross- 
contamination rates per well are 37.31 + 4.11, 21.13 + 1.80, and 8.26 + 0.87 (mean UMIs per cell + SEM) 
for K562, RPE1-FUCCI, and organoid cells, respectively. (C) Design of pulse and chase experiments. 

(D) Boxplots showing UMI counts of labeled mRNAs per cell. RPE1-FUCCI cells were treated with EU for 
the indicated times. (E) As in (D), but RPE1-FUCCI cells were treated with EU for 22 hours and then washed 
and treated with U for the indicated times. Cells shown in (D) and (E) were filtered as described in (15). 
(F) Fractions of UMI of labeled mRNAs containing unspliced introns for RPE1-FUCCI cells. 
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along the cell cycle (median correlation of 
0.730 for 528 genes; Fig. 2C and fig. S10D). 

We found widespread changes of both syn- 
thesis and degradation rates during the cell 
cycle (Fig. 2, B and C). Clustering of the ex- 
pression levels, synthesis rates, and degrada- 
tion rate constants with self-organizing maps 
(SOMs) revealed distinct strategies of mRNA 
regulation during the cell cycle (Fig. 2, B and 
D). To find common properties between the 
different strategies regardless of the position 
of the expression peak, we computed the co- 
sine similarity between synthesis and degra- 
dation dynamics (Fig. 2B and fig. S11A). We 
observed three types of regulatory strategies 
during the cell cycle: cooperative, neutral, and 
destabilizing. The cooperative strategy describes 
an increase in the synthesis rate that is ac- 
companied by a decrease in the degradation 
rate constant and vice versa, thus having a 
negative cosine similarity. The neutral strategy 
is characterized by small relative changes in 
the degradation rate constant compared with 
the synthesis rate. The destabilizing strategy is 
characterized by a simultaneous increase or 
decrease of the synthesis rate and the degra- 
dation rate constant, resulting in positive co- 
sine similarity. Among the groups of genes 
that follow the cooperative strategy, we found 
a subset of genes that have an expression peak 
in Gg and are involved in microtubule spin- 
dle assembly and mitosis regulation (strategy 
group B); genes with a functional enrichment 
for signaling and protein phosphorylation 
(group F); and genes that are expressed during 
S phase and are involved in DNA replication, 
repair, and maintenance (group D; Fig. 2C and 
fig. S11B). We validated the changes of the 
degradation rate constant during the cell cycle 
for these genes (groups B, D, and F) by per- 
forming a bulk chase experiment of 500 pooled 
cells gated for G,, S, or Gp (Pearson 7 between 
0.290 and 0.611; fig. SIIC). Genes that follow 
the neutral strategy were functionally enriched 
in microtubule activity (group A), homology 
recombination repair (group E), or cytokine 
activity, G,/S transition, and DNA replication 
initiation (group C; Fig. 2C and fig. S11B). 
Further simulations in which we varied the 
regimes of « and y throughout the cell cycle 
(fig. S12, A to C) validated that we can accu- 
rately determine the strategy type for most 
of the parameter combinations tested (fig. 
S12, D to G). These results indicate that genes 
with similar cellular functions tend to be con- 
trolled by similar strategies and may share 
posttranscriptional regulators. 

We next investigated the change in pre- 
dicted expression by assuming either a model 
with constant y or constant « (Fig. 2, D and 
E). These constants were chosen to match the 
expression averaged over the cell cycle as ob- 
served in the experimental data. The model 
assuming dynamic y and « could accurately 
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predict the dynamic range and the expression 
timing of our CEL-seq2 dataset. However, 
when either rate was assumed constant, we 
observed changes in both expression prop- 
erties. As expected, the synthesis rate « had 
an impact on the dynamic range of all genes 
(Fig. 2F), whereas the impact on timing was 
strongest for the strategy groups A and G, 
which show constant degradation rates during 
the cell cycle (Fig. 2G). Strong effects on the 
dynamic range could be observed by assuming 
a constant y for the groups of genes showing 
cooperative strategies (groups B, D, and F; Fig. 


Fig. 2. scEU-seq reveals mRNA control 
strategies during the cell cycle. (A) Scatter 
plot of the Geminin-GFP- and Cdtl-RFP- 
corrected signals of RPE1-FUCCI cells 

(n = 5422 cells). Expression levels (total 

UMI counts per cell) of four example genes 
are indicated in blue. (B) Clustered heat 
maps of estimated synthesis and degradation 
rates. Leftmost panel shows the cosine 
similarity (S) between the rates (n = 

528 genes). (C) As in (B) but showing 


2F). In addition, for three of the four clusters 
in group B, the effects on expression timing 
obtained by assuming constant degradation 
were similar to the results obtained by the 
constant synthesis model (Fig. 2G). These re- 
sults imply that the degradation and synthe- 
sis rates are coordinated to achieve precise 
expression dynamics during the cell cycle. 
Next, we asked whether scEU-seq could re- 
veal similar mRNA regulatory strategies dur- 
ing cellular differentiation. We used intestinal 
organoids expressing the GFP-Lgr5 reporter 
in intestinal stem cells (20, 27) (Fig. 3A) and 


log,,(fluorescence), Cdt1 RFP > 


ies] 
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performed a pulse experiment using an EU 
incubation time of 120 min and chase experi- 
ments with a 0-, 45-, or 360-min U chase phase. 
The UMAP (Uniform Manifold Approximation 
and Projection for Dimension Reduction) rep- 
resentation of the 3831 cells that passed quality 
controls places stem cells in the center and 
shows two branches representing the differen- 
tiation trajectories of enterocytes and secretory 
cells (Fig. 3, B to D). We did not observe batch 
effects between the experiments (fig. S13A). 
The expression of the GFP-Lgr5 fusion closely 
matches the measured expression of the Lgr5 
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Fig. 3. mRNA control strategies during intestinal organoid differentiation. 
(A) Schematics of intestinal organoid crypts. (B) UMAP showing the expression 
levels of Lgr5-GFP (green) in 3831 cells from intestinal organoids. (©) UMAP 
showing clusters of cells with similar gene expression (SOM analysis, cluster 
number indicated) and their respective cell identity. (D) UMAP showing the 
expression levels (blue) of four genes that are markers for stem cells, Paneth 
cells, enterocytes, and tuft cells, respectively. (E) UMAP showing the Monocle2 
differentiation branches 1 and 2, as indicated by arrows. Colors indicate the 
monocle trajectory values. (F) Heat maps of the observed expression levels along 
the two differentiation branches, the secretory lineage into Paneth cells (branch 1) 
and the enterocyte lineage (branch 2) (n = 301 genes). Red dots mark the position 
of housekeeping genes. (G) As in (F) but showing the estimated synthesis (left 
panels) and degradation rates (right panels). Genes are clustered and the cosine 
similarity is indicated independently for the two branches. Strategies with strong 
changes in the synthesis and the degradation rates are highlighted in (I) and (J). 
(H) Density plot of the peak timing distance against the dynamic range of the 
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predicted relative to the observed expression for models with dynamic synthesis 
and degradation rates (black, left), constant synthesis rate (blue, middle), and 
constant degradation rate (red, right) [n = 72 genes with strong cooperative and 
destabilizing strategies (groups A, B C, D and E); see (I) and (J)]. Top panels 
compare the distributions of peak timing distances (blue versus black: P = 6.29 x 
10, red versus black: P = nonsignificant, Wilcoxon test). Rightmost panel 
compares the distributions of dynamic ranges (blue versus black: P = 0.021, red 
versus black: P = 0.043, F test for variance, n = 72 genes). (land J) Delta values of 
data shown in (H) for genes with strong cooperative and destabilizing strategies 
(groups A, B C, D, and E). Shown are differences in dynamic range and timing, 
respectively, between the constant synthesis model [blue in (H)] and the full 
dynamic model [black in (H)] and between the constant degradation model 

[red in (H)] and the full dynamic model [black in (H)]. (K) Mean synthesis and 
degradations rates for genes in group A. (L) Mean synthesis and degradation rates 
for genes in group B. (M) Total number of UMIs detected for genes in group A. 
(N) Total number of UMIs detected for genes in group B. 
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transcript (Fig. 3, B and D). We clustered 
single-cell expression levels using SOMs and 
identified stem cells (clusters 8 and 11), fully 
differentiated cells (clusters 1, 3, 5, 9, and 10), 
and potentially intermediate stages (clusters 
10, 4, 6, and 2; Fig. 3C). 

We selected 295 genes that showed a high 
coefficient of variation compared with the mean 
expression and differences in expression be- 
tween the stem cells and cells in intermediate 
or differentiated stages (fig. S13, B and C). We 
used monocle2 (22) to sort cells along the 
differentiation trajectories of secretory cells 
(branch 1) and enterocytes (branch 2; Fig. 3E 
and fig. S14, A to C). We added six housekeeping 
genes as a control and calculated the synthesis 
rates and degradation rate constants through- 
out differentiation for these 301 genes (fig. S14, 
Dto F). 

We used SOMs to cluster genes by their ex- 
pression level, synthesis rates, and degrada- 
tion rate constants. The results demonstrate 
that cells use both the synthesis and degra- 
dation rate to control gene expression during 
differentiation. The housekeeping genes clus- 
tered separately from differentially regulated 
genes between branches 1 and 2 (Fig. 3, F and 
G, and fig. SI5A). For 24% of genes (72 genes), 
the degradation rate changed during differ- 
entiation, whereas the synthesis rate increased, 
displaying cooperative and destabilizing strat- 
egies, respectively, as observed for the cell cycle. 
Among the genes with destabilizing strategies 
(group A), we identified functional enrichment 
for oxidoreductase activity and drug metabo- 
lism (fig. S15B). These genes were up-regulated 
in enterocytes and six of them belong to the 
cytochrome P450 family localized to the endo- 
plasmic reticulum (23). Gene groups with co- 
operative strategies (group D and B) were 
enriched for genes encoding secreted proteins 
or components of the endoplasmic reticulum 
or Golgi complex (fig. SI5B). 

When we analyzed the gene expression dy- 
namics as before, assuming a constant synthe- 
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sis rate, we observed that the dynamic range 
decreased and the expression timing and dy- 
namics changed (Fig. 3H). By contrast, a con- 
stant degradation rate had little effect on the 
timing of the expression peak but increased 
the variance of the dynamic range; although 
the dynamic range of genes with destabiliz- 
ing strategies increased (groups A and E), it 
decreased for genes with cooperating strat- 
egies (groups B and D; Fig. 3, I and J). In agree- 
ment with this, we found that the absolute 
increase in the synthesis rate of genes in group A 
was higher than those in group B (Fig. 3, K 
and L), whereas the expression levels in both 
groups changed along the differentiation tra- 
jectory with similar dynamics and magnitude 
(Fig. 3, M and N). This effect is explained by 
the stabilization of transcripts in group B 
toward the end of differentiation branch 2 
(Fig. 3L and fig. S15C). 

Here, we show that cells use cooperative, 
neutral, or destabilizing strategies to actively 
regulate gene expression during the cell cycle 
and during differentiation. Both synthesis and 
degradation rates control the accuracy and 
precision of the dynamic range and the timing 
of the expression peak. By contrast, during 
differentiation, the degradation rate seems to 
affect only the dynamic range of expression, 
whereas the timing is fully encoded by the 
dynamics of the mRNA synthesis rate. Thus, 
our data support findings that the modula- 
tion of mRNA degradation rates plays a role 
in mammalian cellular homeostasis such as 
T cell homeostasis (24) and differentiation of 
mammalian embryonic stem cells (25). 
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Flow Reactor for Kilogram-Scale Production 
The FlowSyn Maxi from Uniaqsis is a high- 
throughput flow reactor for process chemists 
looking to optimize reaction conditions before 
scaling-up synthetic production to multiple 
kilograms per day. It combines the benefits 

of Uniqsis’ proven FlowSyn technology with 
high flow rates (up to 100 mL/min) in a compact, benchtop design. 
The highly intuitive user interface quickly guides you through the 
process for setting up a flow chemistry reaction. Once set up, the 
reactor automatically runs the experiment unattended, ensuring 
that critical parameters remain within defined limits. Robustly 
constructed from epoxy-coated and bead-blasted stainless steel, 
it can be specified with stainless steel, Hastelloy, or polytetra- 
fluoroethylene (PTFE) flow paths. The unit is fitted with larger-bore 
inlet tubing and upgraded inlet/outlet solenoid valves with bigger 
fluidic channels to cope with the increased flow rate. The FlowSyn 
Maxi is designed to carry out superheated reactions up to +260°C 
(1,400 psi) and cooled reactions down to -70°C—making it one of 
the most versatile process-scale flow reactors available. 

Uniqsis 

For info: +44-(0)-845-864-7747 
www.uniqsis.com/paproductsdetail.aspx?id=flw_maxi 


Knockout Cell Lines 

CRISPR-Cas9 knockout cell lines from Abcam provide you with reli- 
able, off-the-shelf, single-gene knockouts, so that you can confidently 
interrogate the relationship between genotype and phenotype 
without spending months establishing your own knockout cell line. 
We provide a large and expanding range of knockouts in immortal- 
ized mammalian cell lines, such as HeLa and HEK293T. All knockout 
cell lines are Sanger sequenced, and many have additional Western 
blot data to confirm knockout at the proteomic level. With our large 
range of ready-made knockouts in commonly used cell lines, and 
our ever-expanding range of new targets and cell lines, obtaining 
your knockout cell line of choice is as easy as buying a reagent. Don’t 
see the cell line you need in our catalog? Register your interest for 
your desired knockout cell line and we will be in touch if it’s in our 
pipeline. 

Abcam 

For info: 888-772-2226 

www.abcam.com 


Recombinant Antibodies for In Vivo Research 

Absolute Antibody offers VivopureX recombinant mouse antibod- 
ies for in vivo research in mouse models. The collection consists 

of popular antibody clones, many originally obtained from rats or 
hamsters, which Absolute Antibody has engineered into mouse 
anti-mouse recombinant versions to improve research results. The 
antibodies are available in discounted bulk sizes ranging from 

1 mg to 100 mg. VivopureX antibodies are species-matched chimeric 
antibodies, consisting of a clone’s original antigen-binding variable 
domain with a mouse-constant domain, which means they do not 
induce neutralizing antibodies in mouse models. In addition, the 
antibodies feature engineered effector functions, with fragment 
crystallizable (Fc) receptor binding tailored to best suit popular ap- 
plications such as depletion, agonism, or blocking. All antibodies are 
produced recombinantly for ensured batch-to-batch reproducibility 


LIFE SCIENCE TECHNOLOGIES 


new products 


and offer high purity and low endotoxin levels ideal for in vivo ap- 
plications. The antibodies are targeted against key immune system 
proteins, including clinically relevant checkpoint proteins such as 
PD-1, CTLA-4, and OX40. 

Absolute Antibody 

For info: 617-377-4057 

absoluteantibody.com 


Heparanase and Hyaluronidase Activity Kits 

AMS Biotechnology supplies Razie assay kits to measure enzyme ac- 
tivity of heparanase and hyaluronidase in cell culture supernatants, 
human plasma, biological fluids, and tissue samples. Our kits have 
been used for research into anti-inflammatory responses, kidney 
disease, cancer research, and intervertebral disc degeneration. Hep- 
aranase and hyaluronidase enzymes play an important role in the 
extracellular matrix. Our kits provide a solution to the performance 
and standardization problems found with traditional measurement 
techniques used for these types of enzymes. Each assay is available 
in two formats: (1) with enzyme as positive control to demonstrate 
digestion/activity, and (2) without positive control, for end-users who 
have their own enzyme for use as positive control. 

AMS Biotechnology 

For info: 800-987-0985 

www.amsbio.com 


Antigen and Antibody Conjugation Kits 

The Native Antigen Company offers a selection of complementary, 
easy-to-use antibody conjugation kits, with a wide range of labels 
suitable for use in most applications. On request we can also under- 
take custom conjugation of proteins and antibodies. The kits utilize 
a novel chemistry to generate highly reproducible antigen and anti- 
body conjugates with a range of different labels, including enzymes, 
fluorochromes, and biotin. No reconstitution is required, and these 
kits can be used to generate liquid stable conjugates, which are fully 
scalable from 0.01 mg to gram scale. Incorporation is highly efficient, 
helping to ensure lot-to-lot consistency, low backgrounds, and im- 
proved assay sensitivity. 

The Native Antigen Company 

For info: +44-(0)-1865-595230 
thenativeantigencompany.com/conjugation-kits 


PQA Portable Quadrupole Analyzer 

The Hiden pQA portable gas analyzer is a versatile mass spectrom- 
eter and offered with a range of interchangeable sampling inlets to 
suit a broad application range. Membrane introduction mass spec- 
trometry (MIMS) inlets are offered for analysis of dissolved species 
in ground water, fermentation cultures, soil samples, and general 
applications where analysis of dissolved species in a liquid sample 
is required. The system is suited to gas analysis applications, where 
sample volume is small, and for environmental applications where 
detection of low concentration levels is required. The pQA system 
has a mass range of 200 atomic mass units (amu) and sub-ppb 
detection levels; extended mass range to 300 amu is optional. The 
system is supplied in a Pelican case and can be powered by a 12-V 
supply for field use, battery and/or solar powered, or a 220-V supply 
for laboratory use. 

Hiden Analytical 

For info: +44-(0)-1925-445225 

www.hidenanalytical.com 


Electronically submit your new product description or product literature information! Go to www.sciencemag.org/about/new-products-section for more information. 


Newly offered instrumentation, apparatus, and laboratory materials of interest to researchers in all disciplines in academic, industrial, and governmental organizations are featured in this 
space. Emphasis is given to purpose, chief characteristics, and availability of products and materials. Endorsement by Science or AAAS of any products or materials mentioned is not 
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FOCUS ON DIVERSITY AND INCLUSION 


Chair, Biochemistry and 
Structural Biology 


The University of Texas Health Science Center 
San Antonio, dba as UT Health San Antonio 

. (UTHSA), seeks a dynamic academic leader 
to serve as Chair of the Department of Biochemistry and Structural Biology 
(BSB) in the Long School of Medicine. UTHSA is the only academic health 
center in South Texas and combines scientific discovery with clinical care in 
innovative and transformative ways 


Reporting to the Dean, the Chair of BSB is responsible for the overall vision 
and strategic plan for the department and oversight of administrative, academic 
and research activities of the Department. This is a unique opportunity to lead 
an outstanding basic science department with strategic vision and direction. 
UTHSA views diversity as a core value which embodies inclusiveness, 
mutual respect, and multiple perspectives and serves as a catalyst for change 
resulting in health equity. Inclusion fosters innovation, creativity and a sense 
of belonging—reflecting differing ideas, beliefs, experiences, perspectives and 
backgrounds—that lead to biomedical discovery and scientific advancement. 
The UT Health San Antonio is an Equal Employment Opportunity/A ffirmative 
Action Employer and is committed to excellence through diversity among 
its faculty, staff and students including protected veterans and persons with 
disabilities. 


oy UT Health 


Candidates must have a PhD and/or MD, and academic credentials that merit 
appointment at the rank of Professor with tenure. The ideal candidate will have 
a widely recognized national reputation in their field. The ability to foster a 
culture of collaboration, innovation, and accountability across the Health 
Science Center is important. This is a wonderful opportunity for a visionary 
leader. UT Health San Antonio is a research-intensive institution located in San 
Antonio and it sits in the gateway to the picturesque Texas Hill Country. San 
Antonio is a vibrant, multicultural city with year-round recreational activities 
and an attractive cost-of-living. 

To Apply: submit a letter of interest along with a current CV electronically via 
the UT Health Careers portal at https://uthscsa.referrals.selectminds.com/ 
faculty/jobs/chair-department-of-biochemistry-structural-biology-2851. 
Inquiries and to receive a full job prospectus email Shelly Evans, Long School 
of Medicine Chief of Staff at elliss@uthscsa.edu. 


All faculty appointments are designated as security sensitive positions. 


Mae Director, Institute for Integration 
UT Health of 


San Antonio Medicine and Science 


The University of Texas Health Science Center 
San Antonio, dba UT Health San Antonio (UTHSA), is seeking a dynamic 
academic leader to serve as its Director of the Institute for Integration of 
Medicine & Science (IIMS), the academic home for UTHSA’s Clinical and 
Translational Science Award from the National Institutes of Health. 


UTHSA is a comprehensive academic health university and research center 
in South Texas. The Director of the IIMS reports directly to the Dean of the 
Long School of Medicine, and is responsible for the overall administrative, 
academic, and research activities of the IIMS. This is a unique opportunity 
to lead an outstanding research institute with strategic vision and direction. 
UT Health San Antonio combines scientific discovery with clinical care in 
innovative and transformative ways. 


The UT Health San Antonio is an Equal Employment Opportunity/A ffirmative 
Action Employer and is committed to excellence through diversity among 
its faculty, staff and students including protected veterans and persons with 
disabilities. 


Candidates must have a MD and/or PhD, and academic credentials that merit 
appointment at the rank of Professor with tenure. The ideal candidate will have 
a widely-recognized national reputation in their field. The ability to foster 
a culture of collaboration, innovation, and accountability across the Health 
Science Center is important. This is a wonderful opportunity for a visionary 
leader. UT Health San Antonio is a research-intensive institution located in 
San Antonio and it sits in the gateway to the picturesque Texas Hill Country. 
San Antonio is a vibrant, multicultural city with year-round recreational 
activities and an attractive cost-of-living. 


To Apply: submit a letter of interest along with a current CV electronically via 
the UT Health Careers portal at https://uthscsa.referrals.selectminds.com/ 
faculty/jobs/director-institute-for-integration-of-medicine-science-2845. 
Inquiries and to receive a full job prospectus email Shelly Evans, Long School 
of Medicine Chief of Staff at elliss@uthscsa.edu. 


All faculty appointments are designated as security sensitive positions. 


Director, Greehey Children’s 
Cancer Research 
Institute 


oO: UT Health 


San Antonio 


oe B.S Teresa Lozano Long The Long School of Medicine at the 
PEG Malin University of Texas Health Science 
Center at San Antonio (dba UT Health San Antonio) seeks an individual 
with an outstanding record of leadership, scientific and educational 
achievement, and faculty/trainee recruitment and mentoring as its Director, 
Greehey Children’s Cancer Research Institute (GCCRI). Dynamic 
leadership, clear communication, strong interpersonal skills, and a keen 
strategic vision are crucial characteristics required for success. Reporting to 
the Dean of the Long School of Medicine, the Director will be responsible 
for continuing to build and maintain excellence in research programs, high 
quality educational programs, and a collegial interactive culture. 


The UT Health San Antonio is an Equal Employment Opportunity/ 
Affirmative Action Employer and is committed to excellence through 
diversity among its faculty, staff and students including protected veterans 
and persons with disabilities. 


Candidates must have a PhD and/or MD, and academic experience 
consistent with eligibility for full Professor with tenure. The ideal candidate 
will have a widely-recognized national reputation in their field. The ability 
to foster a culture of collaboration, innovation, and accountability across the 
Health Science Center is important. This is a wonderful opportunity for a 
visionary leader. UT Health San Antonio is a research-intensive institution 
located in San Antonio and it sits in the gateway to the picturesque Texas 
Hill Country. San Antonio is a vibrant, multicultural city with year-round 
recreational activities and an attractive cost-of-living. 


To Apply: submit a letter of interest along with a current CV electronically 
via the UT Health Careers portal at https://uthscsa.referrals.selectminds. 
com/faculty/jobs/director-of-greehey-childrens-cancer-research- 
institute-2763. Inquiries and to receive a full job prospectus email Shelly 
Evans, Long School of Medicine Chief of Staff at elliss@uthscsa.edu. 


All faculty appointments are designated as security sensitive positions. 
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ARL Distinguished Postdoctoral Fellowships 


The Army Research Laboratory (ARL) Distinguished Postdoctoral 
Fellowships provide opportunities to pursue independent research in 
ARL laboratories. Fellows benefit by working alongside some of the 
nation’s best scientists and engineers, while enhancing the mission 
and capabilities of the U.S. Army and the warfighter in times of both 
peace and war. 


ARL invites exceptional young researchers to apply. Fellows must 
display extraordinary abilities in scientific research and show clear 
promise of becoming future leaders. Candidates are expected to 
have already successfully tackled a major scientific or engineering 
problem or to have provided a new approach or insight, evidenced by 
a recognized impact in their field. ARL offers five named Fellowships; 
two of these positions are open for the 2020 competition. 


Fellowships are one-year appointments, renewable for up to three 
based on performance. The award includes a $100,000 annual 
stipend, health insurance, paid relocation, and a professional travel 
allowance. Applicants must have completed all requirements 
for a Ph.D. or Sc.D. degree by October 1, 2020, and may not 
be more than five years beyond their doctoral degree as of the 
application deadline. For more information and to apply, visit 
www.nas.edu/arldpf. 


Online applications must be submitted by May 29, 2020 at 5 PM EST. 
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Faculty Position in Host-Pathogen Interactions and Microbial Immunology 
Department of Microbiology-Immunology 


M Northwestern Medicine 
Feinberg Schoo! of Medicine 


The Department of Microbiology-Immunology at Northwestern University Feinberg School of Medicine 
seeks a full-time, tenure-track Investigator at the rank of Assistant, Associate, or Full Professor within the 
broad field of host-pathogen interactions and microbial immunology. Applicants working at the interface 
between bacteriology and immunology or virology and immunology are particularly encouraged to apply. 


Qualified candidates are expected to have a Ph.D., M.D., or equivalent degree as well as postdoctoral 
training or equivalent status. The successful candidate will be expected to establish and maintain a 
vigorous, extramurally funded research program and to participate in our strong graduate and medical 
student training programs. All applicants should have substantial peer-reviewed publications that 
demonstrate productivity and the ability to perform cutting edge research. Candidates for an Assistant 
Professor position should have current or pending external funding, which could include an NIH K-level 
award or equivalent. Candidates at the Associate Professor or Full Professor level should have substantial 
research productivity, current grant support and academic service. The primary criteria for selection will 
be excellence and creativity in research and scholarship. We offer a highly interactive collegial research 
environment with state-of-the-art research and core facilities. 


The start date is negotiable, and the position will remain open until filled. 


Please read ALL instructions and make preparations before proceeding to the application page: 
Applications will only be accepted via online submission (see link below). 
Please prepare all documents in advance as Adobe PDF files, and please be sure all information 
is entered correctly and accurately (especially names and email addresses), as there will be no 
opportunity for online revision after your application has been submitted. 
All required fields in the application form are marked with an asterisk and must be filled before 
clicking the “Submit” button. 

* Be aware that incomplete applications cannot be saved. 


Applications accepted here: https://facultyrecruiting.northwestern.edu/apply/ODg1 


Northwestern University is an Equal Opportunity, Affirmative Action Employer of all protected classes, 
including veterans and individuals with disabilities. Women, racial and ethnic minorities, individuals 
with disabilities, and veterans are encouraged to apply. Hiring is contingent upon eligibility to work in 
the United States. 
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The University of Texas Health Science Center 
San Antonio, dba as UT Health San Antonio 
(UTHSA), is seeking a dynamic, academic 
leader to serve as its Chair of the Department 
of Microbiology, Immunology & Molecular 
Genetics (MIMG). 


The Department of Microbiology, Immunology 
& Molecular Genetics (MIMG) is the nucleus 
for research and education in immunology 
and microbiology at UT Health San Antonio 
and provides a dynamic environment for 
scientific discovery and training. This is a 
unique opportunity to lead an outstanding 
basic science department with strategic vision 
and direction. Reporting to the Dean of the 
Long School of Medicine, the Chair will 
be responsible for overall administrative, 
academic and research activities of this strong 
department. 


Features in myIDP include: 


= Exercises to help you examine your 
skills, interests, and values. 


= Alist of 20 scientific career paths 
with a prediction of which ones best 
fit your skills and interests. 


Visit the website and start 


myDP planning today! 

CJ myIDP.sciencecareers.org 
For more information, contact Abby Walsh at 
. abby@grantcooper.com or 314-449-1599. 
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The UT Health Science Center San Antonio 
aVaas dba as UT Health San Antonio (UTHSA) 


is an Equal Employment Opportunity/ 
Affirmative Action Employer and is committed 
to excellence through diversity among its 
faculty, staff and students including protected 
veterans and persons with disabilities. All 
faculty appointments are designated as security 
sensitive positions. 
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Applications Requested for Stand Up 
To Cancer (SU2C) Convergence Grants 
Exploring Cancer’s Communications 
among the Immune System and the 
Microbiome Using Organoids and Mice 


Goal: develop a set of experimental platforms 
using organoid systems in vitro, and mouse 
models, that will permit a detailed exploration 
and understanding of the interactions and 
chemical communications among the 
cancerous/normal organoid, the immune 
system, and the microbiome. 


Convergence 3.1416 aims to bring together 

several independent research laboratories with 

complementary skill sets and technological 

approaches: 

¢ Laboratories able to make and maintain 
organoids from multiple types of human or 
mouse tissues; 
Cellular immunologists exploring the innate 
immune system and the adaptive immune 
system; 
Virologists and microbiologists who study 
the microbiomes of various tissues; 
Individuals using mass spectroscopy to 
characterize changes in metabolism and to 
identify metabolites that act as signaling 
agents between organ systems; 
Researchers who use multiplex mass 
spectroscopy of peptides to identify proteins 
that could be communicating among 
microbes, lymphocytes, and tissues of the 
body; 
Engineers who can create “chemostats” for 
organoids and their fellow lymphoid and 
microbial travelers, with both aerobic and 
anaerobic compartments; 
Quantitative biologists and physicists 
studying diffusion of nutrients and signals 
for communicating, who can analyze large 
data sets of information collected from 
sequencing, microscopy, and time lapse 
filming of cultures. 


Applications: A letter of intent describing 
current research directions, capabilities, and a 
list of recent publications, information about 
present position, institution, and laboratory 
size and scope. Current funding should 
be included, as well as any conflicts of 
interest. Please submit your applications to: 
StandUpToCancer.org/Convergence by 
June 1, 2020. Questions can be addressed to: 
alustig@su2c.org. 


Notification: Applications will be screened 
by a committee, and 10-15 investigators will 
be invited to a three day ideas laboratory to 
be held at the Institute for Advanced Study, 
in Princeton, New Jersey, at which invited 
investigators will give a short talk about their 
work and capabilities, and break up into groups 
to determine if collaborative efforts can be 
planned. Following this meeting, a subset of 
the group will be selected to write proposals 
ofno more than 10 pages, and supply a budget 
for a three to four year period. The review 
committee will determine which proposals are 
acceptable for funding. 
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FOCUS ON DIVERSITY AND INCLUSION 


— ILLINOIS 


Now recruiting faculty and postdocs to help improve 
photosynthesis for increased yields at the 
University of Illinois 
The Grainger College of Engineering at the University of Illinois at 
Urbana-Champaign invites applications for up to two full-time tenured 
or tenure-track faculty positions in the broad area of photosynthesis 
engineering. The application closes on March 20, 2020. Learn more and 

apply at http://bit.ly/2wltmlb. 


The Carl R. Woese Institute for Genomic Biology at Illinois has an 
open call for postdoctoral research fellows from a variety of fields 
and disciplines to assist with research efforts such as RIPE (https://ripe. 
illinois.edu), ROGUE (https://rogue.illinois.edu), and CABBI (https:// 
cabbi.bio) that are supported by the Bill & Melinda Gates Foundation, the 


U.S. Department of Energy, and others. Applicants must have obtained a 
Ph.D. (or equivalent) in the last 8 years in molecular biology, mathematical 
modeling of biological systems, plant physiology, crop sciences, or related 
fields. We are now accepting applications to fill at least seven vacancies 
but will consider adding positions to recruit more top talent to Illinois. 
Labs with openings include Donald R. Ort, Stephen P. Long, and Carl J. 
Bernacchi. To be given full consideration, applications must be received 
by April 13, 2020, at http://bit.ly/polysynpostdoc. 

The University of Illinois is a world leader in photosynthesis research and 
has contributed to some of the most seminal discoveries in photosynthesis. 
Here we seek to build upon and expand our leadership in this area. 
Illinois is uniquely positioned to capitalize on the traditional strengths 
in photosynthesis research with the emerging engineering/technological 
capabilities associated with genomics and DNA editing, and modeling 
whole plant-environment interactions. 

Illinois is an Equal Opportunity, Affirmative Action Employer. Minorities, 

women, veterans, and individuals with disabilities are encouraged to 
apply. For more information, visit http://go.illinois.edu/EEO. 


POSITIONS OPEN 


: KEK 
HIGH ENERGY ACCELERATOR 
RESEARCH ORGANIZATION 


Call for Nomination for Next Director General of KEK 


KEK, High Energy Accelerator Research Organization, invites nominations for 
the next Director General whose term will begin April 1, 2021. 


In view of his/her role that presides over the business of KEK as a representative 
of the Inter-University Research Institute Corporation, nominees shall be: 

(1) persons of noble character, with relevant knowledge and experience and 
having abilities to manage its educational and research activities properly and 
effectively. 

(2) persons expected to promote with long-term vision and strong scientific 
leadership, the highly advanced, internationalized, and inter-disciplinary 
research activities of KEK by getting support from the public. 

(3) persons expected to establish and carry out the medium-term goals and plans. 


The term of appointment is three years until March 31, 2024 and shall be 
eligible for reappointment only twice. Thus, he/she may not remain in office 
continuously over a period 9 years. We widely accept the nomination of 
the candidates regardless of their nationalities. We would like to ask you to 
recommend the best person who satisfies requirements for the position written 
above. 


Nomination should be accompanied by: (1) letter of recommendation, (2) 
brief personal history of the candidate, and (3) list of major achievements 
(publications, academic papers, commendations and membership of councils, 
etc.). The nomination should be submitted to the following address no later 
than May 29, 2020: 

* Documents should be written either in English or in Japanese. 

+ Forms are available at: https://www.kek.jp/en/newsroom/2020/03/02/1000/ 


Inquiries concerning the nomination should be addressed to: General Affairs 
Division, General Management Department, KEK, High Energy Accelerator 
Research Organization, 1-1 Oho, Tsukuba, Ibaraki 305-0801, Japan; Tel +81- 
29-864-5114 * Fax +81-29-864-5560; Email: kek.dgsc@ml.post.kek.jp. 


FOCUS ON DIVERSITY AND INCLUSION 


LSU 


SHREVEPORT 


The Department of Cellular Biology and Anatomy at the LSU Health Sciences 
Center School of Medicine in Shreveport is currently seeking a candidate 
holding a Ph.D. or equivalent degree for a tenure-track position at the 
level of Assistant and Associate Professor. To be successful, the candidate 
should have expertise in the area of the Anatomical Sciences/Cellular 
Biology. Candidates having established research expertise in vascular 
biology, especially in the areas of stroke and/or the neurobiological sequeliae 
associated with stroke will be given special consideration. 


This position is part of a continued expansion of the research program in the 
department. The requirements for Assistant Professor include a minimum of 
two years of post-doctoral research experience and for Associate Professor 
a minimum of five years of experience at the level of Assistant Professor. 
Applicants are expected to have a record of scholarly and research activity. 
The successful candidate should also have teaching experience in Gross 
Anatomy, and in one or more of the following disciplines: Cellular Biology, 
Neurosciences, or Histology. For the Associate Professor level, extramural 
funding (NIH RO1 or equivalent) is expected. Salary and start-up packages 
will be commensurate with experience. 


This is an exciting opportunity for the appropriate candidate to play a 
significant role in an active, dynamic work environment. It is also expected 
that the new faculty member will take an active role in the continued growth 
of the Center for Cardiovascular Diseases and Sciences. 


Please submit application with full curriculum vitae and names of three 
references via email to: Dr. Kevin McCarthy (kmccar2@lsuhsc.edu), 
Professor and Chair, Department of Cellular Biology and Anatomy, LSU 
Health Sciences Center, School of Medicine in Shreveport, Shreveport, 
LA 71130. Please include with your packet a letter that summarizes your 
future research plans. Review of applications will begin immediately and 
will continue until the position is filled. 


LSU Health Shreveport is committed to diversity and is an Equal 
Opportunity/Equal Access Employer. 


TENURE TRACK POSITIONS 
DEPARTMENT OF BIOCHEMISTRY AND 
MOLECULAR BIOLOGY 
VIRGINIA COMMONWEALTH UNIVERSITY SCHOOL OF MEDICINE 


The Department of Biochemistry and Molecular Biology at Virginia 
Commonwealth University School of Medicine has openings for two 
tenure-track positions. The candidates should have a Ph.D. and strong 
background in cellular and molecular biology, with emphasis on bioactive 
lipids, cancer, cardiovascular disease, or neuroscience. The successful 
candidate will be required to have a well-developed scholarly/research 
portfolio with evidence of multi-disciplinary applications and external 
funding appropriate to complement and expand existing expertise in the 
department. At the associate or full professor level, successful applicants 
will have an outstanding publication record, complemented by substantial 
extramural support. At the assistant professor level, positions require a 
strong publication record, and postdoctoral experience with potential for, 
or evidence of, the ability to secure extramural funding. As promoted by 
VCU, the faculty member should also demonstrate experience working 
in and fostering a diverse faculty, staff, and student environment or 
commitment to do so as a faculty member at VCU. The Department 
belongs to a very active community of investigators and is committed 
to providing an outstanding research environment. Interested candidates 
should apply online at https://www.vcujobs.com. 


Interested candidates should apply at https://www.vcujobs.com 
Search Job: F65930 & F65940 


Virginia Commonwealth University is an Equal Opportunity, Affirmative 
Action University providing access to education and employment 
without regard to age, race, color, national origin, gender, religion, 
sexual orientation, veteran status, political affiliation or disability. 


Welcome back to hometown. 
Thousands of academic job vacancies are in fast-developing China. 


2020 Global Online Job Fair 
arch 13, 2020 High-level Global Talents Recruitment 
arch 25, 2020 Northeast China Doctoral Talents Recruitment 


> 


pril 09, 2020 Southwest China Doctoral Talents Recruitment 


> 


pril 17, 2020 Southeast China Doctoral Talents Recruitment 


> 


pril 24, 2020 Specialty Session (Engineering 


ay 08, 2020 North and Northeast China Doctoral Talents Recruitment 


ay 09, 2020 Hong Kong, Macao and Taiwan Doctoral Talents Recruitment 


ay 15, 2020 High-level Global Talents Recruitment 


Qualification for Applicants 
Global scholars, Doctor and Post-doctor 


Key Disciplines 


Life Sciences, Medicine, Material Sciences, Physical Sciences 


Participating Approach : 
, Please send your CV to Comms Reaimu@ eteelebatel fe, clulern FO 
| 2020 Global Online Job Fair iad 


Scan the QR code to apply for 


2020 Global Online Job Fair 
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WORKING LIFE 


By Ellen K. W. Brennan 


1162 


Learning to say “yes, and’ 


stood behind the curtain, my hands shaking. As I listened to the crowd laugh at the host’s jokes, 
all I could think was, “Why did I believe I could do this?” Two months earlier, I had signed up 
for improv classes at a local theater, along with four other grad students. We hoped that improv 
training—which involves acting out unscripted scenes—would improve our science communica- 
tion skills. That sounded great in principle, and our program would pay the fees. But when the 
host gave us the cue to come on stage, I seriously wondered what I’d gotten myself into. 


When I first heard about the improv 
classes, I was torn. As an introvert, I 
feared getting on stage and impro- 
vising in front of strangers. How- 
ever, I knew I wanted to work as a 
science communicator after finish- 
ing my Ph.D., so it seemed like the 
perfect opportunity to improve my 
speaking ability and gain confidence 
thinking on my feet. I signed up, 
knowing the experience would take 
me well outside my comfort zone. 

During our first class, we learned 
a core concept of improv: “yes, and.” 
It means that, as improvisers, we 
accept what fellow performers say. 
If someone says that rhinos are 
librarians, for example, then rhinos 
are librarians. We do not question 
the logic; we say “yes” and continue 
with the scene as if nothing is awry. 

To do this effectively, our teacher 
warned us that we’d have to avoid second-guessing our- 
selves. Sometimes scenes go in unexpected directions. The 
best improv happens when performers stay open to differ- 
ent possibilities and say whatever pops into their minds. 

I got a taste of how difficult that is when acting out my 
first scene. My classmate turned to me and said, “Mom is 
going to be so mad.” Mad about what? My mind spun out 
ideas, and my inner critic shot them all down. We broke the 
car? No, that’s too easy. We failed a test? No, you don’t want 
your classmates thinking you're stupid on the first day. I 
finally landed on an answer: “Yes, we’re going to be late for 
dinner.” The scene proceeded from there, and we eventually 
finished as two sisters who lost their way on a hiking trail. 

The first few scenes were hard, but as weeks turned 
into months, I became more comfortable thinking on my 
feet and even started to enjoy our classes. I never silenced 
my inner critic entirely, but over time, I didn’t police my 
words with quite so much vigor. I also became better at 
listening, relating to my conversation partners, and com- 
municating clearly in the moment. 


“| feared getting 
on stage and improvising 
in front of strangers.” 


That training came in handy 
6 months ago, when I was giving a 
seminar about my science. An audi- 
ence member surprised me with a 
question that didn’t grow out of the 
information I’d presented. Instead 
of getting flustered, I implemented 
the “yes, and” approach—accepting 
the question at face value and let- 
ting my mind focus on why it was 
asked. That helped me shift gears 
and find an appropriate answer. 

The benefits of improv go beyond 
communication. Early on in grad 
school, I would get stuck when my 
experiments generated unexpected 
data; my inner critic would assume 
I had made a mistake. But now, af- 
ter embracing the “yes, and” concept, 
I no longer go into an experiment 
thinking that I already know the 
story my data are going to tell. 

Last year, I used that approach after encountering con- 
fusing data. Instead of getting discouraged, I kept exploring 
the data and ended up identifying a new type of cell—one 
that wasn’t behaving as expected. If I hadn’t stayed open 
to the possibility that the results were real, I would have 
missed out on the most exciting finding of my Ph.D. so far. 

All scientists can benefit from this lesson. If the data say 
rhinos are librarians, then it’s worth investigating whether 
rhinos are, in fact, librarians. Our job as scientists isn’t to 
generate data that support a preconceived story. Our job is 
to say “yes, and.” 

As for that debut performance, I must admit that I can- 
not remember much of it. In front of the deafeningly loud 
crowd, it went by in a blur. But I do recall that when we left 
the stage, my face ached from grinning. That very night, I 
registered for the next level of classes—excited to discover 
what else I’d learn from improv. 


Ellen K. W. Brennan is a Ph.D. candidate at the University of Michigan, 
Ann Arbor. Send your career story to SciCareerEditor@aaas.org. 
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